Dataset statistics
| Number of variables | 66 |
|---|---|
| Number of observations | 182242 |
| Missing cells | 3131080 |
| Missing cells (%) | 26.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 91.8 MiB |
| Average record size in memory | 528.0 B |
Variable types
| Numeric | 23 |
|---|---|
| Text | 14 |
| Categorical | 27 |
| Boolean | 1 |
| Unsupported | 1 |
BLDG_SEQ has constant value "1" | Constant |
SFYI_VALUE has constant value "0" | Constant |
AC_TYPE is highly overall correlated with COM_UNITS and 4 other fields | High correlation |
BDRM_COND is highly overall correlated with GROSS_AREA and 3 other fields | High correlation |
BED_RMS is highly overall correlated with FULL_BTH and 5 other fields | High correlation |
BTHRM_STYLE1 is highly overall correlated with BTHRM_STYLE2 and 8 other fields | High correlation |
BTHRM_STYLE2 is highly overall correlated with BTHRM_STYLE1 and 7 other fields | High correlation |
BTHRM_STYLE3 is highly overall correlated with BTHRM_STYLE1 and 9 other fields | High correlation |
CD_FLOOR is highly overall correlated with KITCHEN_STYLE2 and 2 other fields | High correlation |
CITY is highly overall correlated with CM_ID and 4 other fields | High correlation |
CM_ID is highly overall correlated with CITY and 5 other fields | High correlation |
COM_UNITS is highly overall correlated with AC_TYPE and 6 other fields | High correlation |
CORNER_UNIT is highly overall correlated with GROSS_AREA and 5 other fields | High correlation |
EXT_FNISHED is highly overall correlated with NUM_PARKING and 1 other fields | High correlation |
FULL_BTH is highly overall correlated with BED_RMS and 2 other fields | High correlation |
GIS_ID is highly overall correlated with CITY and 4 other fields | High correlation |
GROSS_AREA is highly overall correlated with AC_TYPE and 19 other fields | High correlation |
HEAT_SYSTEM is highly overall correlated with COM_UNITS and 5 other fields | High correlation |
HEAT_TYPE is highly overall correlated with COM_UNITS and 4 other fields | High correlation |
INT_COND is highly overall correlated with BTHRM_STYLE1 and 4 other fields | High correlation |
INT_WALL is highly overall correlated with GROSS_AREA and 1 other fields | High correlation |
KITCHENS is highly overall correlated with BED_RMS and 4 other fields | High correlation |
KITCHEN_STYLE1 is highly overall correlated with BTHRM_STYLE1 and 8 other fields | High correlation |
KITCHEN_STYLE2 is highly overall correlated with BTHRM_STYLE1 and 11 other fields | High correlation |
KITCHEN_STYLE3 is highly overall correlated with BDRM_COND and 13 other fields | High correlation |
KITCHEN_TYPE is highly overall correlated with GROSS_AREA and 4 other fields | High correlation |
LIVING_AREA is highly overall correlated with AC_TYPE and 18 other fields | High correlation |
LU is highly overall correlated with CD_FLOOR and 9 other fields | High correlation |
LUC is highly overall correlated with BTHRM_STYLE1 and 9 other fields | High correlation |
NUM_PARKING is highly overall correlated with BDRM_COND and 5 other fields | High correlation |
ORIENTATION is highly overall correlated with GROSS_AREA and 5 other fields | High correlation |
OWN_OCC is highly overall correlated with COM_UNITS and 3 other fields | High correlation |
PID is highly overall correlated with CITY and 4 other fields | High correlation |
PROP_VIEW is highly overall correlated with COM_UNITS and 1 other fields | High correlation |
RC_UNITS is highly overall correlated with AC_TYPE and 6 other fields | High correlation |
RES_FLOOR is highly overall correlated with BED_RMS and 5 other fields | High correlation |
RES_UNITS is highly overall correlated with AC_TYPE and 4 other fields | High correlation |
ROOF_COVER is highly overall correlated with ROOF_STRUCTURE | High correlation |
ROOF_STRUCTURE is highly overall correlated with ROOF_COVER | High correlation |
STRUCTURE_CLASS is highly overall correlated with COM_UNITS and 3 other fields | High correlation |
TT_RMS is highly overall correlated with BED_RMS and 5 other fields | High correlation |
YR_BUILT is highly overall correlated with BTHRM_STYLE3 and 3 other fields | High correlation |
ZIP_CODE is highly overall correlated with CITY and 4 other fields | High correlation |
_id is highly overall correlated with CITY and 4 other fields | High correlation |
NUM_BLDGS is highly imbalanced (99.9%) | Imbalance |
INT_WALL is highly imbalanced (88.5%) | Imbalance |
OVERALL_COND is highly imbalanced (69.8%) | Imbalance |
BDRM_COND is highly imbalanced (61.9%) | Imbalance |
PROP_VIEW is highly imbalanced (62.3%) | Imbalance |
CM_ID has 88951 (48.8%) missing values | Missing |
ST_NUM has 9363 (5.1%) missing values | Missing |
UNIT_NUM has 99629 (54.7%) missing values | Missing |
BLDG_TYPE has 2616 (1.4%) missing values | Missing |
MAIL_ADDRESSEE has 147830 (81.1%) missing values | Missing |
RES_FLOOR has 33792 (18.5%) missing values | Missing |
CD_FLOOR has 110270 (60.5%) missing values | Missing |
RES_UNITS has 171474 (94.1%) missing values | Missing |
COM_UNITS has 171474 (94.1%) missing values | Missing |
RC_UNITS has 171474 (94.1%) missing values | Missing |
LAND_SF has 8002 (4.4%) missing values | Missing |
GROSS_AREA has 33848 (18.6%) missing values | Missing |
LIVING_AREA has 34141 (18.7%) missing values | Missing |
YR_BUILT has 22786 (12.5%) missing values | Missing |
YR_REMODEL has 95524 (52.4%) missing values | Missing |
STRUCTURE_CLASS has 164836 (90.4%) missing values | Missing |
ROOF_STRUCTURE has 36225 (19.9%) missing values | Missing |
ROOF_COVER has 36219 (19.9%) missing values | Missing |
INT_WALL has 48749 (26.7%) missing values | Missing |
EXT_FNISHED has 22884 (12.6%) missing values | Missing |
INT_COND has 48746 (26.7%) missing values | Missing |
EXT_COND has 36158 (19.8%) missing values | Missing |
OVERALL_COND has 9587 (5.3%) missing values | Missing |
BED_RMS has 48765 (26.8%) missing values | Missing |
FULL_BTH has 11644 (6.4%) missing values | Missing |
HLF_BTH has 11509 (6.3%) missing values | Missing |
KITCHENS has 11718 (6.4%) missing values | Missing |
TT_RMS has 48829 (26.8%) missing values | Missing |
BDRM_COND has 110500 (60.6%) missing values | Missing |
BTHRM_STYLE1 has 49548 (27.2%) missing values | Missing |
BTHRM_STYLE2 has 97077 (53.3%) missing values | Missing |
BTHRM_STYLE3 has 145740 (80.0%) missing values | Missing |
KITCHEN_TYPE has 49555 (27.2%) missing values | Missing |
KITCHEN_STYLE1 has 49549 (27.2%) missing values | Missing |
KITCHEN_STYLE2 has 150994 (82.9%) missing values | Missing |
KITCHEN_STYLE3 has 168497 (92.5%) missing values | Missing |
HEAT_TYPE has 48242 (26.5%) missing values | Missing |
HEAT_SYSTEM has 110013 (60.4%) missing values | Missing |
AC_TYPE has 48272 (26.5%) missing values | Missing |
FIREPLACES has 49534 (27.2%) missing values | Missing |
ORIENTATION has 110268 (60.5%) missing values | Missing |
NUM_PARKING has 48623 (26.7%) missing values | Missing |
PROP_VIEW has 46953 (25.8%) missing values | Missing |
CORNER_UNIT has 110271 (60.5%) missing values | Missing |
COM_UNITS is highly skewed (γ1 = 61.02897174) | Skewed |
RC_UNITS is highly skewed (γ1 = 59.88420383) | Skewed |
GROSS_AREA is highly skewed (γ1 = 55.89596558) | Skewed |
LIVING_AREA is highly skewed (γ1 = 63.65241616) | Skewed |
YR_BUILT is highly skewed (γ1 = 146.1067555) | Skewed |
YR_REMODEL is highly skewed (γ1 = 248.0727809) | Skewed |
NUM_PARKING is highly skewed (γ1 = 29.67993455) | Skewed |
_id is uniformly distributed | Uniform |
_id has unique values | Unique |
MAIL_ZIP_CODE is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
CD_FLOOR has 8077 (4.4%) zeros | Zeros |
COM_UNITS has 10131 (5.6%) zeros | Zeros |
RC_UNITS has 10731 (5.9%) zeros | Zeros |
BED_RMS has 3184 (1.7%) zeros | Zeros |
FULL_BTH has 36940 (20.3%) zeros | Zeros |
HLF_BTH has 135661 (74.4%) zeros | Zeros |
KITCHENS has 36863 (20.2%) zeros | Zeros |
FIREPLACES has 96980 (53.2%) zeros | Zeros |
NUM_PARKING has 58524 (32.1%) zeros | Zeros |
Reproduction
| Analysis started | 2024-09-12 18:37:28.999228 |
|---|---|
| Analysis finished | 2024-09-12 18:38:49.866347 |
| Duration | 1 minute and 20.87 seconds |
| Software version | ydata-profiling vv4.10.0 |
| Download configuration | config.json |
_id
Real number (ℝ)
HIGH CORRELATION  UNIFORM  UNIQUE 
| Distinct | 182242 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 91121.5 |
| Minimum | 1 |
|---|---|
| Maximum | 182242 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9113.05 |
| Q1 | 45561.25 |
| median | 91121.5 |
| Q3 | 136681.75 |
| 95-th percentile | 173129.95 |
| Maximum | 182242 |
| Range | 182241 |
| Interquartile range (IQR) | 91120.5 |
Descriptive statistics
| Standard deviation | 52608.878 |
|---|---|
| Coefficient of variation (CV) | 0.57734869 |
| Kurtosis | -1.2 |
| Mean | 91121.5 |
| Median Absolute Deviation (MAD) | 45560.5 |
| Skewness | 0 |
| Sum | 1.6606164 × 1010 |
| Variance | 2.7676941 × 109 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 1 | 1 | < 0.1% |
| 121466 | 1 | < 0.1% |
| 121490 | 1 | < 0.1% |
| 121491 | 1 | < 0.1% |
| 121492 | 1 | < 0.1% |
| 121493 | 1 | < 0.1% |
| 121494 | 1 | < 0.1% |
| 121495 | 1 | < 0.1% |
| 121496 | 1 | < 0.1% |
| 121497 | 1 | < 0.1% |
| Other values (182232) | 182232 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 182242 | 1 | |
| 182241 | 1 | |
| 182240 | 1 | |
| 182239 | 1 | |
| 182238 | 1 | |
| 182237 | 1 | |
| 182236 | 1 | |
| 182235 | 1 | |
| 182234 | 1 | |
| 182233 | 1 |
PID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 182235 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1400927 × 109 |
| Minimum | 1.00001 × 108 |
|---|---|
| Maximum | 2.20567 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 1.00001 × 108 |
|---|---|
| 5-th percentile | 1.0619905 × 108 |
| Q1 | 5.0158801 × 108 |
| median | 1.1026175 × 109 |
| Q3 | 1.810508 × 109 |
| 95-th percentile | 2.102473 × 109 |
| Maximum | 2.20567 × 109 |
| Range | 2.105669 × 109 |
| Interquartile range (IQR) | 1.30892 × 109 |
Descriptive statistics
| Standard deviation | 7.0911136 × 108 |
|---|---|
| Coefficient of variation (CV) | 0.62197691 |
| Kurtosis | -1.5581055 |
| Mean | 1.1400927 × 109 |
| Median Absolute Deviation (MAD) | 7.0156449 × 108 |
| Skewness | 0.034817957 |
| Sum | 2.0777278 × 1014 |
| Variance | 5.0283892 × 1017 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 2100710002 | 2 | < 0.1% |
| 1404289000 | 2 | < 0.1% |
| 602671206 | 2 | < 0.1% |
| 203102008 | 2 | < 0.1% |
| 501706002 | 2 | < 0.1% |
| 1300783000 | 2 | < 0.1% |
| 1603979000 | 2 | < 0.1% |
| 1702873000 | 1 | < 0.1% |
| 1702866000 | 1 | < 0.1% |
| 1702867000 | 1 | < 0.1% |
| Other values (182225) | 182225 |
| Value | Count | Frequency (%) |
| 100001000 | 1 | |
| 100002000 | 1 | |
| 100003000 | 1 | |
| 100004000 | 1 | |
| 100005000 | 1 | |
| 100006000 | 1 | |
| 100007000 | 1 | |
| 100008000 | 1 | |
| 100009000 | 1 | |
| 100010000 | 1 |
| Value | Count | Frequency (%) |
| 2205670000 | 1 | |
| 2205669000 | 1 | |
| 2205668000 | 1 | |
| 2205667000 | 1 | |
| 2205666000 | 1 | |
| 2205665004 | 1 | |
| 2205665002 | 1 | |
| 2205665000 | 1 | |
| 2205664000 | 1 | |
| 2205663001 | 1 |
CM_ID
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 10770 |
|---|---|
| Distinct (%) | 11.5% |
| Missing | 88951 |
| Missing (%) | 48.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.1756087 × 108 |
| Minimum | 1.00018 × 108 |
|---|---|
| Maximum | 2.205665 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 1.00018 × 108 |
|---|---|
| 5-th percentile | 2.00204 × 108 |
| Q1 | 3.06906 × 108 |
| median | 6.026424 × 108 |
| Q3 | 1.602331 × 109 |
| 95-th percentile | 2.102331 × 109 |
| Maximum | 2.205665 × 109 |
| Range | 2.105647 × 109 |
| Interquartile range (IQR) | 1.295425 × 109 |
Descriptive statistics
| Standard deviation | 6.8964632 × 108 |
|---|---|
| Coefficient of variation (CV) | 0.75160825 |
| Kurtosis | -1.0506641 |
| Mean | 9.1756087 × 108 |
| Median Absolute Deviation (MAD) | 3.002954 × 108 |
| Skewness | 0.71010661 |
| Sum | 8.5600171 × 1013 |
| Variance | 4.7561204 × 1017 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 300450000 | 846 | 0.5% |
| 300475000 | 816 | 0.4% |
| 304850000 | 685 | 0.4% |
| 306010010 | 538 | 0.3% |
| 304590010 | 443 | 0.2% |
| 306455010 | 425 | 0.2% |
| 401149020 | 412 | 0.2% |
| 203506010 | 370 | 0.2% |
| 2205550001 | 355 | 0.2% |
| 2101925000 | 339 | 0.2% |
| Other values (10760) | 88062 | |
| (Missing) | 88951 |
| Value | Count | Frequency (%) |
| 100018000 | 5 | |
| 100019000 | 4 | |
| 100024000 | 4 | |
| 100041000 | 5 | |
| 100046000 | 3 | |
| 100109000 | 4 | |
| 100141000 | 4 | |
| 100145000 | 4 | |
| 100153000 | 5 | |
| 100154000 | 5 |
| Value | Count | Frequency (%) |
| 2205665000 | 3 | < 0.1% |
| 2205642000 | 3 | < 0.1% |
| 2205629000 | 3 | < 0.1% |
| 2205589000 | 3 | < 0.1% |
| 2205550001 | 355 | |
| 2205525000 | 5 | < 0.1% |
| 2205523000 | 94 | 0.1% |
| 2205511000 | 3 | < 0.1% |
| 2205474000 | 9 | < 0.1% |
| 2205464000 | 4 | < 0.1% |
GIS_ID
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 98531 |
|---|---|
| Distinct (%) | 54.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.1400938 × 109 |
| Minimum | 1.00001 × 108 |
|---|---|
| Maximum | 2.20567 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 1.00001 × 108 |
|---|---|
| 5-th percentile | 1.0619905 × 108 |
| Q1 | 5.0158825 × 108 |
| median | 1.1026175 × 109 |
| Q3 | 1.810508 × 109 |
| 95-th percentile | 2.102473 × 109 |
| Maximum | 2.20567 × 109 |
| Range | 2.105669 × 109 |
| Interquartile range (IQR) | 1.3089198 × 109 |
Descriptive statistics
| Standard deviation | 7.0911226 × 108 |
|---|---|
| Coefficient of variation (CV) | 0.62197712 |
| Kurtosis | -1.5581098 |
| Mean | 1.1400938 × 109 |
| Median Absolute Deviation (MAD) | 7.015645 × 108 |
| Skewness | 0.03481601 |
| Sum | 2.0777298 × 1014 |
| Variance | 5.028402 × 1017 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 300450000 | 865 | 0.5% |
| 300475000 | 848 | 0.5% |
| 304850000 | 701 | 0.4% |
| 306010010 | 539 | 0.3% |
| 306455010 | 462 | 0.3% |
| 602642000 | 454 | 0.2% |
| 304590010 | 444 | 0.2% |
| 401149010 | 415 | 0.2% |
| 203506010 | 370 | 0.2% |
| 2205550001 | 355 | 0.2% |
| Other values (98521) | 176789 |
| Value | Count | Frequency (%) |
| 100001000 | 1 | |
| 100002000 | 1 | |
| 100003000 | 1 | |
| 100004000 | 1 | |
| 100005000 | 1 | |
| 100006000 | 1 | |
| 100007000 | 1 | |
| 100008000 | 1 | |
| 100009000 | 1 | |
| 100010000 | 1 |
| Value | Count | Frequency (%) |
| 2205670000 | 1 | < 0.1% |
| 2205669000 | 1 | < 0.1% |
| 2205668000 | 1 | < 0.1% |
| 2205667000 | 1 | < 0.1% |
| 2205666000 | 1 | < 0.1% |
| 2205665000 | 3 | |
| 2205664000 | 1 | < 0.1% |
| 2205663001 | 1 | < 0.1% |
| 2205663000 | 1 | < 0.1% |
| 2205662020 | 1 | < 0.1% |
ST_NUM
Real number (ℝ)
MISSING 
| Distinct | 2753 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 9363 |
| Missing (%) | 5.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 226.4317 |
| Minimum | 0 |
|---|---|
| Maximum | 5341 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 24 |
| median | 68 |
| Q3 | 212 |
| 95-th percentile | 945 |
| Maximum | 5341 |
| Range | 5341 |
| Interquartile range (IQR) | 188 |
Descriptive statistics
| Standard deviation | 475.70457 |
|---|---|
| Coefficient of variation (CV) | 2.1008745 |
| Kurtosis | 40.156888 |
| Mean | 226.4317 |
| Median Absolute Deviation (MAD) | 56 |
| Skewness | 5.4182895 |
| Sum | 39145285 |
| Variance | 226294.83 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2890 | 1.6% |
| 15 | 2685 | 1.5% |
| 2 | 2633 | 1.4% |
| 10 | 2456 | 1.3% |
| 6 | 2339 | 1.3% |
| 9 | 2315 | 1.3% |
| 11 | 2129 | 1.2% |
| 8 | 2061 | 1.1% |
| 7 | 1842 | 1.0% |
| 5 | 1836 | 1.0% |
| Other values (2743) | 149693 | |
| (Missing) | 9363 | 5.1% |
| Value | Count | Frequency (%) |
| 0 | 4 | < 0.1% |
| 1 | 2890 | |
| 2 | 2633 | |
| 3 | 1466 | |
| 4 | 1401 | |
| 5 | 1836 | |
| 6 | 2339 | |
| 7 | 1842 | |
| 8 | 2061 | |
| 9 | 2315 |
| Value | Count | Frequency (%) |
| 5341 | 1 | < 0.1% |
| 5337 | 5 | |
| 5335 | 1 | < 0.1% |
| 5330 | 1 | < 0.1% |
| 5321 | 1 | < 0.1% |
| 5318 | 1 | < 0.1% |
| 5314 | 1 | < 0.1% |
| 5313 | 1 | < 0.1% |
| 5309 | 1 | < 0.1% |
| 5305 | 1 | < 0.1% |
ST_NAME
Text
| Distinct | 4510 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 26 |
|---|---|
| Median length | 24 |
| Mean length | 10.551514 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1922929 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 397 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | PUTNAM ST |
|---|---|
| 2nd row | Lexington ST |
| 3rd row | Lexington ST |
| 4th row | Lexington ST |
| 5th row | Lexington ST |
| Value | Count | Frequency (%) |
| st | 123791 | |
| av | 25332 | 6.5% |
| rd | 16142 | 4.2% |
| e | 6110 | 1.6% |
| w | 5141 | 1.3% |
| commonwealth | 4994 | 1.3% |
| washington | 4250 | 1.1% |
| pl | 3482 | 0.9% |
| beacon | 3470 | 0.9% |
| hill | 3102 | 0.8% |
| Other values (3336) | 191555 |
Most occurring characters
| Value | Count | Frequency (%) |
| 205139 | 10.7% | |
| T | 189045 | 9.8% |
| S | 174821 | 9.1% |
| A | 103657 | 5.4% |
| E | 101553 | 5.3% |
| R | 90042 | 4.7% |
| O | 77275 | 4.0% |
| N | 73625 | 3.8% |
| L | 65593 | 3.4% |
| D | 51927 | 2.7% |
| Other values (47) | 790252 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1922929 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 205139 | 10.7% | |
| T | 189045 | 9.8% |
| S | 174821 | 9.1% |
| A | 103657 | 5.4% |
| E | 101553 | 5.3% |
| R | 90042 | 4.7% |
| O | 77275 | 4.0% |
| N | 73625 | 3.8% |
| L | 65593 | 3.4% |
| D | 51927 | 2.7% |
| Other values (47) | 790252 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1922929 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 205139 | 10.7% | |
| T | 189045 | 9.8% |
| S | 174821 | 9.1% |
| A | 103657 | 5.4% |
| E | 101553 | 5.3% |
| R | 90042 | 4.7% |
| O | 77275 | 4.0% |
| N | 73625 | 3.8% |
| L | 65593 | 3.4% |
| D | 51927 | 2.7% |
| Other values (47) | 790252 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1922929 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 205139 | 10.7% | |
| T | 189045 | 9.8% |
| S | 174821 | 9.1% |
| A | 103657 | 5.4% |
| E | 101553 | 5.3% |
| R | 90042 | 4.7% |
| O | 77275 | 4.0% |
| N | 73625 | 3.8% |
| L | 65593 | 3.4% |
| D | 51927 | 2.7% |
| Other values (47) | 790252 |
UNIT_NUM
Text
MISSING 
| Distinct | 14534 |
|---|---|
| Distinct (%) | 17.6% |
| Missing | 99629 |
| Missing (%) | 54.7% |
| Memory size | 1.4 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 18 |
| Mean length | 2.6008982 |
| Min length | 1 |
Characters and Unicode
| Total characters | 214868 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 10328 ? |
|---|---|
| Unique (%) | 12.5% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 3 |
| 4th row | 4 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 8656 | 10.2% |
| 2 | 8643 | 10.2% |
| 3 | 6435 | 7.6% |
| 4 | 2653 | 3.1% |
| 5 | 1740 | 2.1% |
| 6 | 1255 | 1.5% |
| ps | 1164 | 1.4% |
| 7 | 849 | 1.0% |
| 8 | 747 | 0.9% |
| 9 | 575 | 0.7% |
| Other values (13765) | 52103 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 36155 | |
| 2 | 27837 | |
| 3 | 20766 | |
| - | 19827 | |
| 0 | 18116 | |
| 4 | 14999 | 7.0% |
| 5 | 11888 | 5.5% |
| 6 | 9760 | 4.5% |
| 7 | 7460 | 3.5% |
| 8 | 6530 | 3.0% |
| Other values (47) | 41530 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 214868 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 36155 | |
| 2 | 27837 | |
| 3 | 20766 | |
| - | 19827 | |
| 0 | 18116 | |
| 4 | 14999 | 7.0% |
| 5 | 11888 | 5.5% |
| 6 | 9760 | 4.5% |
| 7 | 7460 | 3.5% |
| 8 | 6530 | 3.0% |
| Other values (47) | 41530 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 214868 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 36155 | |
| 2 | 27837 | |
| 3 | 20766 | |
| - | 19827 | |
| 0 | 18116 | |
| 4 | 14999 | 7.0% |
| 5 | 11888 | 5.5% |
| 6 | 9760 | 4.5% |
| 7 | 7460 | 3.5% |
| 8 | 6530 | 3.0% |
| Other values (47) | 41530 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 214868 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 36155 | |
| 2 | 27837 | |
| 3 | 20766 | |
| - | 19827 | |
| 0 | 18116 | |
| 4 | 14999 | 7.0% |
| 5 | 11888 | 5.5% |
| 6 | 9760 | 4.5% |
| 7 | 7460 | 3.5% |
| 8 | 6530 | 3.0% |
| Other values (47) | 41530 |
CITY
Categorical
HIGH CORRELATION 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 1.4 MiB |
| BOSTON | |
|---|---|
| DORCHESTER | |
| SOUTH BOSTON | |
| JAMAICA PLAIN | |
| BRIGHTON | |
| Other values (14) |
Length
| Max length | 16 |
|---|---|
| Median length | 12 |
| Mean length | 9.206937 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1677863 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | EAST BOSTON |
|---|---|
| 2nd row | EAST BOSTON |
| 3rd row | EAST BOSTON |
| 4th row | EAST BOSTON |
| 5th row | EAST BOSTON |
Common Values
| Value | Count | Frequency (%) |
| BOSTON | 47713 | |
| DORCHESTER | 29328 | |
| SOUTH BOSTON | 15622 | 8.6% |
| JAMAICA PLAIN | 12147 | 6.7% |
| BRIGHTON | 12113 | 6.6% |
| WEST ROXBURY | 11006 | 6.0% |
| EAST BOSTON | 10233 | 5.6% |
| ROSLINDALE | 9279 | 5.1% |
| HYDE PARK | 9192 | 5.0% |
| CHARLESTOWN | 7252 | 4.0% |
| Other values (9) | 18354 | 10.1% |
Length
| Value | Count | Frequency (%) |
| boston | 73568 | |
| dorchester | 29328 | 12.1% |
| roxbury | 18991 | 7.8% |
| south | 15622 | 6.4% |
| jamaica | 12147 | 5.0% |
| plain | 12147 | 5.0% |
| brighton | 12113 | 5.0% |
| west | 11006 | 4.5% |
| east | 10233 | 4.2% |
| roslindale | 9279 | 3.8% |
| Other values (12) | 38868 |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 246073 | |
| T | 175338 | |
| S | 165454 | |
| R | 136346 | 8.1% |
| N | 126567 | 7.5% |
| E | 106670 | 6.4% |
| B | 104696 | 6.2% |
| A | 103595 | 6.2% |
| H | 75547 | 4.5% |
| 61063 | 3.6% | |
| Other values (14) | 376514 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1677863 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| O | 246073 | |
| T | 175338 | |
| S | 165454 | |
| R | 136346 | 8.1% |
| N | 126567 | 7.5% |
| E | 106670 | 6.4% |
| B | 104696 | 6.2% |
| A | 103595 | 6.2% |
| H | 75547 | 4.5% |
| 61063 | 3.6% | |
| Other values (14) | 376514 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1677863 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| O | 246073 | |
| T | 175338 | |
| S | 165454 | |
| R | 136346 | 8.1% |
| N | 126567 | 7.5% |
| E | 106670 | 6.4% |
| B | 104696 | 6.2% |
| A | 103595 | 6.2% |
| H | 75547 | 4.5% |
| 61063 | 3.6% | |
| Other values (14) | 376514 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1677863 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| O | 246073 | |
| T | 175338 | |
| S | 165454 | |
| R | 136346 | 8.1% |
| N | 126567 | 7.5% |
| E | 106670 | 6.4% |
| B | 104696 | 6.2% |
| A | 103595 | 6.2% |
| H | 75547 | 4.5% |
| 61063 | 3.6% | |
| Other values (14) | 376514 |
ZIP_CODE
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 37 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2129.8679 |
| Minimum | 2026 |
|---|---|
| Maximum | 2467 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 2026 |
|---|---|
| 5-th percentile | 2111 |
| Q1 | 2119 |
| median | 2127 |
| Q3 | 2131 |
| 95-th percentile | 2136 |
| Maximum | 2467 |
| Range | 441 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 30.721915 |
|---|---|
| Coefficient of variation (CV) | 0.014424328 |
| Kurtosis | 81.218517 |
| Mean | 2129.8679 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 8.1238643 |
| Sum | 3.88145 × 108 |
| Variance | 943.83603 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2127 | 15656 | 8.6% |
| 2130 | 12154 | 6.7% |
| 2135 | 12114 | 6.6% |
| 2124 | 11124 | 6.1% |
| 2132 | 11004 | 6.0% |
| 2128 | 10231 | 5.6% |
| 2116 | 9649 | 5.3% |
| 2118 | 9384 | 5.1% |
| 2131 | 9283 | 5.1% |
| 2136 | 9192 | 5.0% |
| Other values (27) | 72448 |
| Value | Count | Frequency (%) |
| 2026 | 6 | < 0.1% |
| 2108 | 2172 | 1.2% |
| 2109 | 1847 | 1.0% |
| 2110 | 2487 | 1.4% |
| 2111 | 2893 | 1.6% |
| 2113 | 2357 | 1.3% |
| 2114 | 5352 | |
| 2115 | 5548 | |
| 2116 | 9649 | |
| 2118 | 9384 |
| Value | Count | Frequency (%) |
| 2467 | 1017 | 0.6% |
| 2458 | 1 | < 0.1% |
| 2446 | 11 | < 0.1% |
| 2445 | 13 | < 0.1% |
| 2219 | 1 | < 0.1% |
| 2215 | 3649 | |
| 2210 | 2132 | |
| 2201 | 3 | < 0.1% |
| 2199 | 36 | < 0.1% |
| 2137 | 2 | < 0.1% |
BLDG_SEQ
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 1 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 182242 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 182242 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 182242 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 182242 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 182242 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 182242 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 182242 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 182242 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 182242 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 182242 |
NUM_BLDGS
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 1 | |
|---|---|
| 2 | 14 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 182242 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 182228 | |
| 2 | 14 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 182228 | |
| 2 | 14 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 182228 | |
| 2 | 14 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 182242 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 182228 | |
| 2 | 14 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 182242 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 182228 | |
| 2 | 14 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 182242 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 182228 | |
| 2 | 14 | < 0.1% |
LUC
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 201 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 202.58164 |
| Minimum | 13 |
|---|---|
| Maximum | 995 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 101 |
| Q1 | 102 |
| median | 102 |
| Q3 | 108 |
| 95-th percentile | 995 |
| Maximum | 995 |
| Range | 982 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 266.13109 |
|---|---|
| Coefficient of variation (CV) | 1.3136979 |
| Kurtosis | 4.3641341 |
| Mean | 202.58164 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.4745422 |
| Sum | 36918884 |
| Variance | 70825.757 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 102 | 71974 | |
| 101 | 30439 | |
| 104 | 16814 | 9.2% |
| 105 | 13298 | 7.3% |
| 995 | 10768 | 5.9% |
| 108 | 8451 | 4.6% |
| 132 | 4185 | 2.3% |
| 111 | 2496 | 1.4% |
| 13 | 2270 | 1.2% |
| 985 | 2185 | 1.2% |
| Other values (191) | 19362 | 10.6% |
| Value | Count | Frequency (%) |
| 13 | 2270 | 1.2% |
| 31 | 665 | 0.4% |
| 101 | 30439 | |
| 102 | 71974 | |
| 103 | 2 | < 0.1% |
| 104 | 16814 | 9.2% |
| 105 | 13298 | 7.3% |
| 106 | 786 | 0.4% |
| 108 | 8451 | 4.6% |
| 109 | 170 | 0.1% |
| Value | Count | Frequency (%) |
| 995 | 10768 | |
| 992 | 10 | < 0.1% |
| 991 | 14 | < 0.1% |
| 990 | 5 | < 0.1% |
| 988 | 2 | < 0.1% |
| 987 | 59 | < 0.1% |
| 986 | 817 | 0.4% |
| 985 | 2185 | 1.2% |
| 983 | 16 | < 0.1% |
| 982 | 1 | < 0.1% |
LU
Categorical
HIGH CORRELATION 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| CD | |
|---|---|
| R1 | |
| R2 | |
| R3 | |
| CM | |
| Other values (12) |
Length
| Max length | 7 |
|---|---|
| Median length | 2 |
| Mean length | 2.0794548 |
| Min length | 1 |
Characters and Unicode
| Total characters | 378964 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | R3 |
|---|---|
| 2nd row | R3 |
| 3rd row | R3 |
| 4th row | R3 |
| 5th row | R2 |
Common Values
| Value | Count | Frequency (%) |
| CD | 71988 | |
| R1 | 30441 | |
| R2 | 16814 | 9.2% |
| R3 | 13468 | 7.4% |
| CM | 10768 | 5.9% |
| CP | 8451 | 4.6% |
| E | 7610 | 4.2% |
| RL - RL | 6030 | 3.3% |
| C | 4658 | 2.6% |
| A | 2964 | 1.6% |
| Other values (7) | 9050 | 5.0% |
Length
| Value | Count | Frequency (%) |
| cd | 71988 | |
| r1 | 30441 | |
| r2 | 16814 | 8.7% |
| r3 | 13468 | 6.9% |
| rl | 12060 | 6.2% |
| cm | 10768 | 5.5% |
| cp | 8451 | 4.3% |
| e | 7610 | 3.9% |
| 6030 | 3.1% | |
| c | 4658 | 2.4% |
| Other values (8) | 12014 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 103296 | |
| R | 78214 | |
| D | 71988 | |
| 1 | 30441 | 8.0% |
| 2 | 16814 | 4.4% |
| 3 | 13468 | 3.6% |
| L | 13446 | 3.5% |
| 12060 | 3.2% | |
| M | 10768 | 2.8% |
| P | 8451 | 2.2% |
| Other values (6) | 20018 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 378964 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| C | 103296 | |
| R | 78214 | |
| D | 71988 | |
| 1 | 30441 | 8.0% |
| 2 | 16814 | 4.4% |
| 3 | 13468 | 3.6% |
| L | 13446 | 3.5% |
| 12060 | 3.2% | |
| M | 10768 | 2.8% |
| P | 8451 | 2.2% |
| Other values (6) | 20018 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 378964 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| C | 103296 | |
| R | 78214 | |
| D | 71988 | |
| 1 | 30441 | 8.0% |
| 2 | 16814 | 4.4% |
| 3 | 13468 | 3.6% |
| L | 13446 | 3.5% |
| 12060 | 3.2% | |
| M | 10768 | 2.8% |
| P | 8451 | 2.2% |
| Other values (6) | 20018 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 378964 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| C | 103296 | |
| R | 78214 | |
| D | 71988 | |
| 1 | 30441 | 8.0% |
| 2 | 16814 | 4.4% |
| 3 | 13468 | 3.6% |
| L | 13446 | 3.5% |
| 12060 | 3.2% | |
| M | 10768 | 2.8% |
| P | 8451 | 2.2% |
| Other values (6) | 20018 | 5.3% |
LU_DESC
Text
| Distinct | 194 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 28 |
|---|---|
| Median length | 26 |
| Mean length | 16.888231 |
| Min length | 5 |
Characters and Unicode
| Total characters | 3077745 |
|---|---|
| Distinct characters | 68 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 16 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | THREE-FAM DWELLING |
|---|---|
| 2nd row | THREE-FAM DWELLING |
| 3rd row | THREE-FAM DWELLING |
| 4th row | THREE-FAM DWELLING |
| 5th row | TWO-FAM DWELLING |
| Value | Count | Frequency (%) |
| condo | 92825 | |
| residential | 72979 | |
| dwelling | 60551 | |
| single | 30439 | 7.2% |
| fam | 30439 | 7.2% |
| two-fam | 16814 | 4.0% |
| res | 15805 | 3.7% |
| three-fam | 13298 | 3.1% |
| main | 10768 | 2.5% |
| parking | 8987 | 2.1% |
| Other values (264) | 71041 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 306547 | 10.0% |
| N | 293643 | 9.5% |
| I | 277658 | 9.0% |
| L | 247639 | 8.0% |
| 241900 | 7.9% | |
| D | 239004 | 7.8% |
| O | 225960 | 7.3% |
| A | 176683 | 5.7% |
| S | 140752 | 4.6% |
| T | 130272 | 4.2% |
| Other values (58) | 797687 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3077745 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 306547 | 10.0% |
| N | 293643 | 9.5% |
| I | 277658 | 9.0% |
| L | 247639 | 8.0% |
| 241900 | 7.9% | |
| D | 239004 | 7.8% |
| O | 225960 | 7.3% |
| A | 176683 | 5.7% |
| S | 140752 | 4.6% |
| T | 130272 | 4.2% |
| Other values (58) | 797687 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3077745 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 306547 | 10.0% |
| N | 293643 | 9.5% |
| I | 277658 | 9.0% |
| L | 247639 | 8.0% |
| 241900 | 7.9% | |
| D | 239004 | 7.8% |
| O | 225960 | 7.3% |
| A | 176683 | 5.7% |
| S | 140752 | 4.6% |
| T | 130272 | 4.2% |
| Other values (58) | 797687 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3077745 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 306547 | 10.0% |
| N | 293643 | 9.5% |
| I | 277658 | 9.0% |
| L | 247639 | 8.0% |
| 241900 | 7.9% | |
| D | 239004 | 7.8% |
| O | 225960 | 7.3% |
| A | 176683 | 5.7% |
| S | 140752 | 4.6% |
| T | 130272 | 4.2% |
| Other values (58) | 797687 |
BLDG_TYPE
Text
MISSING 
| Distinct | 201 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2616 |
| Missing (%) | 1.4% |
| Memory size | 1.4 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 31 |
| Mean length | 13.795586 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2478046 |
|---|---|
| Distinct characters | 69 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | RE - Row End |
|---|---|
| 2nd row | RM - Row Middle |
| 3rd row | RM - Row Middle |
| 4th row | RM - Row Middle |
| 5th row | RE - Row End |
| Value | Count | Frequency (%) |
| 168505 | ||
| rise | 38625 | 6.1% |
| row | 26473 | 4.2% |
| rm | 17763 | 2.8% |
| middle | 17698 | 2.8% |
| cl | 16789 | 2.6% |
| colonial | 16789 | 2.6% |
| lr | 15448 | 2.4% |
| low | 15448 | 2.4% |
| mr | 15194 | 2.4% |
| Other values (462) | 285650 |
Most occurring characters
| Value | Count | Frequency (%) |
| 454951 | ||
| - | 179927 | 7.3% |
| R | 147705 | 6.0% |
| e | 145112 | 5.9% |
| i | 129604 | 5.2% |
| o | 126568 | 5.1% |
| n | 106689 | 4.3% |
| d | 85062 | 3.4% |
| a | 82052 | 3.3% |
| l | 79352 | 3.2% |
| Other values (59) | 941024 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2478046 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 454951 | ||
| - | 179927 | 7.3% |
| R | 147705 | 6.0% |
| e | 145112 | 5.9% |
| i | 129604 | 5.2% |
| o | 126568 | 5.1% |
| n | 106689 | 4.3% |
| d | 85062 | 3.4% |
| a | 82052 | 3.3% |
| l | 79352 | 3.2% |
| Other values (59) | 941024 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2478046 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 454951 | ||
| - | 179927 | 7.3% |
| R | 147705 | 6.0% |
| e | 145112 | 5.9% |
| i | 129604 | 5.2% |
| o | 126568 | 5.1% |
| n | 106689 | 4.3% |
| d | 85062 | 3.4% |
| a | 82052 | 3.3% |
| l | 79352 | 3.2% |
| Other values (59) | 941024 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2478046 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 454951 | ||
| - | 179927 | 7.3% |
| R | 147705 | 6.0% |
| e | 145112 | 5.9% |
| i | 129604 | 5.2% |
| o | 126568 | 5.1% |
| n | 106689 | 4.3% |
| d | 85062 | 3.4% |
| a | 82052 | 3.3% |
| l | 79352 | 3.2% |
| Other values (59) | 941024 |
OWN_OCC
Boolean
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 178.1 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 104952 | |
| True | 77290 |
OWNER
Text
| Distinct | 143345 |
|---|---|
| Distinct (%) | 78.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 95 |
|---|---|
| Median length | 73 |
| Mean length | 18.269499 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3329470 |
|---|---|
| Distinct characters | 79 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 124603 ? |
|---|---|
| Unique (%) | 68.4% |
Sample
| 1st row | PASCUCCI CARLO |
|---|---|
| 2nd row | SEMBRANO RODERICK |
| 3rd row | GUERRA CHEVARRIA ANA S |
| 4th row | JB REALTY TRUST |
| 5th row | MARKS TRAVIS JOSEPH |
| Value | Count | Frequency (%) |
| llc | 22354 | 3.9% |
| trust | 18834 | 3.3% |
| street | 8131 | 1.4% |
| a | 7749 | 1.4% |
| m | 7240 | 1.3% |
| j | 6867 | 1.2% |
| realty | 6548 | 1.1% |
| of | 5074 | 0.9% |
| boston | 5042 | 0.9% |
| condo | 4882 | 0.9% |
| Other values (66663) | 478018 |
Most occurring characters
| Value | Count | Frequency (%) |
| 392298 | ||
| E | 287947 | 8.6% |
| A | 277585 | 8.3% |
| R | 227193 | 6.8% |
| N | 217593 | 6.5% |
| T | 207937 | 6.2% |
| L | 203316 | 6.1% |
| O | 190013 | 5.7% |
| I | 184146 | 5.5% |
| S | 179232 | 5.4% |
| Other values (69) | 962210 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3329470 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 392298 | ||
| E | 287947 | 8.6% |
| A | 277585 | 8.3% |
| R | 227193 | 6.8% |
| N | 217593 | 6.5% |
| T | 207937 | 6.2% |
| L | 203316 | 6.1% |
| O | 190013 | 5.7% |
| I | 184146 | 5.5% |
| S | 179232 | 5.4% |
| Other values (69) | 962210 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3329470 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 392298 | ||
| E | 287947 | 8.6% |
| A | 277585 | 8.3% |
| R | 227193 | 6.8% |
| N | 217593 | 6.5% |
| T | 207937 | 6.2% |
| L | 203316 | 6.1% |
| O | 190013 | 5.7% |
| I | 184146 | 5.5% |
| S | 179232 | 5.4% |
| Other values (69) | 962210 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3329470 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 392298 | ||
| E | 287947 | 8.6% |
| A | 277585 | 8.3% |
| R | 227193 | 6.8% |
| N | 217593 | 6.5% |
| T | 207937 | 6.2% |
| L | 203316 | 6.1% |
| O | 190013 | 5.7% |
| I | 184146 | 5.5% |
| S | 179232 | 5.4% |
| Other values (69) | 962210 |
MAIL_ADDRESSEE
Text
MISSING 
| Distinct | 24033 |
|---|---|
| Distinct (%) | 69.8% |
| Missing | 147830 |
| Missing (%) | 81.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 84 |
|---|---|
| Median length | 63 |
| Mean length | 21.515082 |
| Min length | 7 |
Characters and Unicode
| Total characters | 740377 |
|---|---|
| Distinct characters | 60 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 20481 ? |
|---|---|
| Unique (%) | 59.5% |
Sample
| 1st row | C/O LAUREN SCHOENADEL |
|---|---|
| 2nd row | C/O ARTEM & CAITLYN SHKURATOV |
| 3rd row | C/O MILDRED CASIELLO |
| 4th row | C/O PETER LAPLANTE |
| 5th row | C/O NASSER FARD |
| Value | Count | Frequency (%) |
| c/o | 34398 | 25.3% |
| llc | 2892 | 2.1% |
| ts | 2188 | 1.6% |
| 1798 | 1.3% | |
| inc | 1314 | 1.0% |
| j | 1197 | 0.9% |
| management | 1068 | 0.8% |
| a | 1027 | 0.8% |
| m | 1011 | 0.7% |
| realty | 948 | 0.7% |
| Other values (18312) | 88013 |
Most occurring characters
| Value | Count | Frequency (%) |
| 102450 | ||
| O | 68138 | 9.2% |
| C | 57372 | 7.7% |
| E | 54226 | 7.3% |
| A | 53772 | 7.3% |
| N | 42498 | 5.7% |
| R | 42209 | 5.7% |
| / | 35062 | 4.7% |
| I | 34455 | 4.7% |
| L | 34441 | 4.7% |
| Other values (50) | 215754 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 740377 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 102450 | ||
| O | 68138 | 9.2% |
| C | 57372 | 7.7% |
| E | 54226 | 7.3% |
| A | 53772 | 7.3% |
| N | 42498 | 5.7% |
| R | 42209 | 5.7% |
| / | 35062 | 4.7% |
| I | 34455 | 4.7% |
| L | 34441 | 4.7% |
| Other values (50) | 215754 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 740377 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 102450 | ||
| O | 68138 | 9.2% |
| C | 57372 | 7.7% |
| E | 54226 | 7.3% |
| A | 53772 | 7.3% |
| N | 42498 | 5.7% |
| R | 42209 | 5.7% |
| / | 35062 | 4.7% |
| I | 34455 | 4.7% |
| L | 34441 | 4.7% |
| Other values (50) | 215754 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 740377 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 102450 | ||
| O | 68138 | 9.2% |
| C | 57372 | 7.7% |
| E | 54226 | 7.3% |
| A | 53772 | 7.3% |
| N | 42498 | 5.7% |
| R | 42209 | 5.7% |
| / | 35062 | 4.7% |
| I | 34455 | 4.7% |
| L | 34441 | 4.7% |
| Other values (50) | 215754 |
| Distinct | 142503 |
|---|---|
| Distinct (%) | 78.2% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 48 |
| Mean length | 16.857878 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3072146 |
|---|---|
| Distinct characters | 77 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 125351 ? |
|---|---|
| Unique (%) | 68.8% |
Sample
| 1st row | 195 LEXINGTON ST |
|---|---|
| 2nd row | 197 LEXINGTON ST |
| 3rd row | 199 LEXINGTON ST |
| 4th row | PO BOX 557 # |
| 5th row | 203 Lexington ST |
| Value | Count | Frequency (%) |
| st | 100012 | 14.7% |
| unit | 29533 | 4.4% |
| rd | 18882 | 2.8% |
| av | 11821 | 1.7% |
| 1 | 11615 | 1.7% |
| ave | 11160 | 1.6% |
| 2 | 10828 | 1.6% |
| 3 | 7692 | 1.1% |
| e | 4263 | 0.6% |
| 4 | 4116 | 0.6% |
| Other values (20381) | 468203 |
Most occurring characters
| Value | Count | Frequency (%) |
| 500810 | 16.3% | |
| T | 205716 | 6.7% |
| S | 174198 | 5.7% |
| E | 164084 | 5.3% |
| A | 142235 | 4.6% |
| R | 134766 | 4.4% |
| O | 117647 | 3.8% |
| 1 | 115240 | 3.8% |
| N | 113509 | 3.7% |
| L | 91393 | 3.0% |
| Other values (67) | 1312548 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3072146 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 500810 | 16.3% | |
| T | 205716 | 6.7% |
| S | 174198 | 5.7% |
| E | 164084 | 5.3% |
| A | 142235 | 4.6% |
| R | 134766 | 4.4% |
| O | 117647 | 3.8% |
| 1 | 115240 | 3.8% |
| N | 113509 | 3.7% |
| L | 91393 | 3.0% |
| Other values (67) | 1312548 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3072146 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 500810 | 16.3% | |
| T | 205716 | 6.7% |
| S | 174198 | 5.7% |
| E | 164084 | 5.3% |
| A | 142235 | 4.6% |
| R | 134766 | 4.4% |
| O | 117647 | 3.8% |
| 1 | 115240 | 3.8% |
| N | 113509 | 3.7% |
| L | 91393 | 3.0% |
| Other values (67) | 1312548 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3072146 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 500810 | 16.3% | |
| T | 205716 | 6.7% |
| S | 174198 | 5.7% |
| E | 164084 | 5.3% |
| A | 142235 | 4.6% |
| R | 134766 | 4.4% |
| O | 117647 | 3.8% |
| 1 | 115240 | 3.8% |
| N | 113509 | 3.7% |
| L | 91393 | 3.0% |
| Other values (67) | 1312548 |
MAIL_CITY
Text
| Distinct | 2394 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 11 |
| Missing (%) | < 0.1% |
| Memory size | 1.4 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 30 |
| Mean length | 8.9950283 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1639173 |
|---|---|
| Distinct characters | 69 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1162 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | EAST BOSTON |
|---|---|
| 2nd row | EAST BOSTON |
| 3rd row | EAST BOSTON |
| 4th row | EVERETT |
| 5th row | EAST BOSTON |
| Value | Count | Frequency (%) |
| boston | 61826 | |
| dorchester | 24084 | 10.1% |
| roxbury | 15809 | 6.6% |
| south | 11238 | 4.7% |
| jamaica | 10649 | 4.5% |
| plain | 10648 | 4.5% |
| west | 10430 | 4.4% |
| roslindale | 8304 | 3.5% |
| park | 8152 | 3.4% |
| hyde | 8070 | 3.4% |
| Other values (2112) | 69163 |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 224351 | |
| T | 158522 | 9.7% |
| S | 145589 | 8.9% |
| N | 129729 | 7.9% |
| R | 128219 | 7.8% |
| E | 118211 | 7.2% |
| A | 105399 | 6.4% |
| B | 92889 | 5.7% |
| H | 69147 | 4.2% |
| L | 58882 | 3.6% |
| Other values (59) | 408235 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1639173 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| O | 224351 | |
| T | 158522 | 9.7% |
| S | 145589 | 8.9% |
| N | 129729 | 7.9% |
| R | 128219 | 7.8% |
| E | 118211 | 7.2% |
| A | 105399 | 6.4% |
| B | 92889 | 5.7% |
| H | 69147 | 4.2% |
| L | 58882 | 3.6% |
| Other values (59) | 408235 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1639173 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| O | 224351 | |
| T | 158522 | 9.7% |
| S | 145589 | 8.9% |
| N | 129729 | 7.9% |
| R | 128219 | 7.8% |
| E | 118211 | 7.2% |
| A | 105399 | 6.4% |
| B | 92889 | 5.7% |
| H | 69147 | 4.2% |
| L | 58882 | 3.6% |
| Other values (59) | 408235 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1639173 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| O | 224351 | |
| T | 158522 | 9.7% |
| S | 145589 | 8.9% |
| N | 129729 | 7.9% |
| R | 128219 | 7.8% |
| E | 118211 | 7.2% |
| A | 105399 | 6.4% |
| B | 92889 | 5.7% |
| H | 69147 | 4.2% |
| L | 58882 | 3.6% |
| Other values (59) | 408235 |
MAIL_STATE
Text
| Distinct | 67 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 319 |
| Missing (%) | 0.2% |
| Memory size | 1.4 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 2 |
| Mean length | 2.0008355 |
| Min length | 1 |
Characters and Unicode
| Total characters | 363998 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | MA |
|---|---|
| 2nd row | MA |
| 3rd row | MA |
| 4th row | MA |
| 5th row | MA |
| Value | Count | Frequency (%) |
| ma | 173979 | |
| ny | 1250 | 0.7% |
| fl | 1172 | 0.6% |
| ca | 959 | 0.5% |
| nh | 702 | 0.4% |
| tx | 544 | 0.3% |
| ct | 473 | 0.3% |
| nj | 284 | 0.2% |
| ri | 280 | 0.2% |
| me | 203 | 0.1% |
| Other values (61) | 2086 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 175610 | |
| M | 174515 | |
| N | 2568 | 0.7% |
| C | 1824 | 0.5% |
| L | 1338 | 0.4% |
| Y | 1273 | 0.3% |
| F | 1176 | 0.3% |
| T | 1169 | 0.3% |
| H | 852 | 0.2% |
| I | 572 | 0.2% |
| Other values (30) | 3101 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 363998 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 175610 | |
| M | 174515 | |
| N | 2568 | 0.7% |
| C | 1824 | 0.5% |
| L | 1338 | 0.4% |
| Y | 1273 | 0.3% |
| F | 1176 | 0.3% |
| T | 1169 | 0.3% |
| H | 852 | 0.2% |
| I | 572 | 0.2% |
| Other values (30) | 3101 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 363998 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 175610 | |
| M | 174515 | |
| N | 2568 | 0.7% |
| C | 1824 | 0.5% |
| L | 1338 | 0.4% |
| Y | 1273 | 0.3% |
| F | 1176 | 0.3% |
| T | 1169 | 0.3% |
| H | 852 | 0.2% |
| I | 572 | 0.2% |
| Other values (30) | 3101 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 363998 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 175610 | |
| M | 174515 | |
| N | 2568 | 0.7% |
| C | 1824 | 0.5% |
| L | 1338 | 0.4% |
| Y | 1273 | 0.3% |
| F | 1176 | 0.3% |
| T | 1169 | 0.3% |
| H | 852 | 0.2% |
| I | 572 | 0.2% |
| Other values (30) | 3101 | 0.9% |
MAIL_ZIP_CODE
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 61 |
|---|---|
| Missing (%) | < 0.1% |
| Memory size | 1.4 MiB |
RES_FLOOR
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 48 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 33792 |
| Missing (%) | 18.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.8803537 |
| Minimum | 0 |
|---|---|
| Maximum | 62 |
| Zeros | 31 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2.5 |
| 95-th percentile | 3 |
| Maximum | 62 |
| Range | 62 |
| Interquartile range (IQR) | 1.5 |
Descriptive statistics
| Standard deviation | 1.1290417 |
|---|---|
| Coefficient of variation (CV) | 0.60044113 |
| Kurtosis | 321.52964 |
| Mean | 1.8803537 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 9.6638775 |
| Sum | 279138.5 |
| Variance | 1.2747351 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 63070 | |
| 2 | 42412 | |
| 3 | 24604 | 13.5% |
| 2.5 | 7712 | 4.2% |
| 4 | 4616 | 2.5% |
| 1.5 | 3505 | 1.9% |
| 5 | 1360 | 0.7% |
| 3.5 | 487 | 0.3% |
| 6 | 237 | 0.1% |
| 4.5 | 105 | 0.1% |
| Other values (38) | 342 | 0.2% |
| (Missing) | 33792 |
| Value | Count | Frequency (%) |
| 0 | 31 | < 0.1% |
| 1 | 63070 | |
| 1.5 | 3505 | 1.9% |
| 2 | 42412 | |
| 2.5 | 7712 | 4.2% |
| 3 | 24604 | 13.5% |
| 3.5 | 487 | 0.3% |
| 4 | 4616 | 2.5% |
| 4.5 | 105 | 0.1% |
| 5 | 1360 | 0.7% |
| Value | Count | Frequency (%) |
| 62 | 1 | < 0.1% |
| 60 | 2 | |
| 46 | 3 | |
| 45 | 1 | < 0.1% |
| 41 | 2 | |
| 40 | 1 | < 0.1% |
| 39 | 2 | |
| 36 | 1 | < 0.1% |
| 35 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
CD_FLOOR
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 60 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 110270 |
| Missing (%) | 60.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5203135 |
| Minimum | 0 |
|---|---|
| Maximum | 60 |
| Zeros | 8077 |
| Zeros (%) | 4.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 12 |
| Maximum | 60 |
| Range | 60 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 5.2569276 |
|---|---|
| Coefficient of variation (CV) | 1.4933124 |
| Kurtosis | 28.761902 |
| Mean | 3.5203135 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 4.6866222 |
| Sum | 253364 |
| Variance | 27.635288 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 16764 | 9.2% |
| 1 | 14700 | 8.1% |
| 3 | 13735 | 7.5% |
| 0 | 8077 | 4.4% |
| 4 | 6543 | 3.6% |
| 5 | 3448 | 1.9% |
| 6 | 1762 | 1.0% |
| 7 | 965 | 0.5% |
| 8 | 707 | 0.4% |
| 9 | 597 | 0.3% |
| Other values (50) | 4674 | 2.6% |
| (Missing) | 110270 |
| Value | Count | Frequency (%) |
| 0 | 8077 | |
| 1 | 14700 | |
| 2 | 16764 | |
| 3 | 13735 | |
| 4 | 6543 | 3.6% |
| 5 | 3448 | 1.9% |
| 6 | 1762 | 1.0% |
| 7 | 965 | 0.5% |
| 8 | 707 | 0.4% |
| 9 | 597 | 0.3% |
| Value | Count | Frequency (%) |
| 60 | 4 | < 0.1% |
| 59 | 6 | |
| 58 | 7 | |
| 57 | 7 | |
| 56 | 9 | |
| 55 | 10 | |
| 54 | 10 | |
| 53 | 10 | |
| 52 | 10 | |
| 51 | 10 |
RES_UNITS
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 149 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 171474 |
| Missing (%) | 94.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.6886144 |
| Minimum | 0 |
|---|---|
| Maximum | 477 |
| Zeros | 218 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 20 |
| Maximum | 477 |
| Range | 477 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 18.17673 |
|---|---|
| Coefficient of variation (CV) | 2.7175628 |
| Kurtosis | 209.23304 |
| Mean | 6.6886144 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 11.97815 |
| Sum | 72023 |
| Variance | 330.39351 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 3962 | 2.2% |
| 2 | 2784 | 1.5% |
| 4 | 1057 | 0.6% |
| 5 | 526 | 0.3% |
| 6 | 519 | 0.3% |
| 8 | 223 | 0.1% |
| 0 | 218 | 0.1% |
| 9 | 194 | 0.1% |
| 7 | 149 | 0.1% |
| 10 | 136 | 0.1% |
| Other values (139) | 1000 | 0.5% |
| (Missing) | 171474 |
| Value | Count | Frequency (%) |
| 0 | 218 | 0.1% |
| 1 | 29 | < 0.1% |
| 2 | 2784 | |
| 3 | 3962 | |
| 4 | 1057 | 0.6% |
| 5 | 526 | 0.3% |
| 6 | 519 | 0.3% |
| 7 | 149 | 0.1% |
| 8 | 223 | 0.1% |
| 9 | 194 | 0.1% |
| Value | Count | Frequency (%) |
| 477 | 1 | |
| 463 | 1 | |
| 442 | 1 | |
| 372 | 1 | |
| 367 | 1 | |
| 354 | 1 | |
| 338 | 1 | |
| 312 | 1 | |
| 311 | 1 | |
| 271 | 1 |
COM_UNITS
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED  ZEROS 
| Distinct | 24 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 171474 |
| Missing (%) | 94.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.17737741 |
| Minimum | 0 |
|---|---|
| Maximum | 212 |
| Zeros | 10131 |
| Zeros (%) | 5.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 212 |
| Range | 212 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.6274729 |
|---|---|
| Coefficient of variation (CV) | 14.812894 |
| Kurtosis | 4459.5512 |
| Mean | 0.17737741 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 61.028972 |
| Sum | 1910 |
| Variance | 6.9036137 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 10131 | 5.6% |
| 1 | 350 | 0.2% |
| 2 | 134 | 0.1% |
| 3 | 58 | < 0.1% |
| 4 | 24 | < 0.1% |
| 5 | 18 | < 0.1% |
| 6 | 10 | < 0.1% |
| 8 | 9 | < 0.1% |
| 7 | 8 | < 0.1% |
| 9 | 7 | < 0.1% |
| Other values (14) | 19 | < 0.1% |
| (Missing) | 171474 |
| Value | Count | Frequency (%) |
| 0 | 10131 | |
| 1 | 350 | 0.2% |
| 2 | 134 | 0.1% |
| 3 | 58 | < 0.1% |
| 4 | 24 | < 0.1% |
| 5 | 18 | < 0.1% |
| 6 | 10 | < 0.1% |
| 7 | 8 | < 0.1% |
| 8 | 9 | < 0.1% |
| 9 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 212 | 1 | |
| 127 | 1 | |
| 60 | 1 | |
| 38 | 1 | |
| 26 | 2 | |
| 23 | 1 | |
| 21 | 1 | |
| 20 | 1 | |
| 18 | 1 | |
| 15 | 1 |
RC_UNITS
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED  ZEROS 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 171474 |
| Missing (%) | 94.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.010958395 |
| Minimum | 0 |
|---|---|
| Maximum | 29 |
| Zeros | 10731 |
| Zeros (%) | 5.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 29 |
| Range | 29 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.39999848 |
|---|---|
| Coefficient of variation (CV) | 36.501556 |
| Kurtosis | 3907.048 |
| Mean | 0.010958395 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 59.884204 |
| Sum | 118 |
| Variance | 0.15999878 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 10731 | 5.9% |
| 1 | 24 | < 0.1% |
| 2 | 7 | < 0.1% |
| 5 | 2 | < 0.1% |
| 29 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| 24 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| (Missing) | 171474 |
| Value | Count | Frequency (%) |
| 0 | 10731 | |
| 1 | 24 | < 0.1% |
| 2 | 7 | < 0.1% |
| 3 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 24 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 29 | 1 | < 0.1% |
| 24 | 1 | < 0.1% |
| 14 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
| 2 | 7 | < 0.1% |
| 1 | 24 | < 0.1% |
| 0 | 10731 |
LAND_SF
Text
MISSING 
| Distinct | 17559 |
|---|---|
| Distinct (%) | 10.1% |
| Missing | 8002 |
| Missing (%) | 4.4% |
| Memory size | 1.4 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 5 |
| Mean length | 4.5646235 |
| Min length | 3 |
Characters and Unicode
| Total characters | 795340 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7616 ? |
|---|---|
| Unique (%) | 4.4% |
Sample
| 1st row | 1,150 |
|---|---|
| 2nd row | 1,150 |
| 3rd row | 1,150 |
| 4th row | 1,150 |
| 5th row | 2,010 |
| Value | Count | Frequency (%) |
| 5,000 | 2392 | 1.4% |
| 4,000 | 1353 | 0.8% |
| 2,500 | 862 | 0.5% |
| 6,000 | 858 | 0.5% |
| 4,500 | 671 | 0.4% |
| 5,500 | 641 | 0.4% |
| 3,600 | 536 | 0.3% |
| 3,200 | 492 | 0.3% |
| 3,000 | 490 | 0.3% |
| 2,000 | 347 | 0.2% |
| Other values (17549) | 165598 |
Most occurring characters
| Value | Count | Frequency (%) |
| , | 130489 | |
| 0 | 106022 | |
| 1 | 93910 | |
| 5 | 73860 | |
| 2 | 66439 | |
| 4 | 60552 | |
| 3 | 59000 | |
| 6 | 57057 | |
| 7 | 52775 | |
| 8 | 50775 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 795340 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| , | 130489 | |
| 0 | 106022 | |
| 1 | 93910 | |
| 5 | 73860 | |
| 2 | 66439 | |
| 4 | 60552 | |
| 3 | 59000 | |
| 6 | 57057 | |
| 7 | 52775 | |
| 8 | 50775 | 6.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 795340 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| , | 130489 | |
| 0 | 106022 | |
| 1 | 93910 | |
| 5 | 73860 | |
| 2 | 66439 | |
| 4 | 60552 | |
| 3 | 59000 | |
| 6 | 57057 | |
| 7 | 52775 | |
| 8 | 50775 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 795340 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| , | 130489 | |
| 0 | 106022 | |
| 1 | 93910 | |
| 5 | 73860 | |
| 2 | 66439 | |
| 4 | 60552 | |
| 3 | 59000 | |
| 6 | 57057 | |
| 7 | 52775 | |
| 8 | 50775 | 6.4% |
GROSS_AREA
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED 
| Distinct | 13098 |
|---|---|
| Distinct (%) | 8.8% |
| Missing | 33848 |
| Missing (%) | 18.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5434.6768 |
| Minimum | 3 |
|---|---|
| Maximum | 6982322 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 544 |
| Q1 | 967 |
| median | 2085 |
| Q3 | 4008 |
| 95-th percentile | 7770.7 |
| Maximum | 6982322 |
| Range | 6982319 |
| Interquartile range (IQR) | 3041 |
Descriptive statistics
| Standard deviation | 41322.818 |
|---|---|
| Coefficient of variation (CV) | 7.6035465 |
| Kurtosis | 6477.9546 |
| Mean | 5434.6768 |
| Median Absolute Deviation (MAD) | 1271 |
| Skewness | 55.895966 |
| Sum | 8.0647342 × 108 |
| Variance | 1.7075753 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 780 | 282 | 0.2% |
| 600 | 237 | 0.1% |
| 700 | 230 | 0.1% |
| 625 | 220 | 0.1% |
| 690 | 214 | 0.1% |
| 800 | 213 | 0.1% |
| 1050 | 211 | 0.1% |
| 760 | 210 | 0.1% |
| 775 | 203 | 0.1% |
| 730 | 197 | 0.1% |
| Other values (13088) | 146177 | |
| (Missing) | 33848 | 18.6% |
| Value | Count | Frequency (%) |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 42 | 1 | < 0.1% |
| 60 | 1 | < 0.1% |
| 82 | 1 | < 0.1% |
| 90 | 1 | < 0.1% |
| 100 | 125 | |
| 102 | 1 | < 0.1% |
| 106 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6982322 | 1 | |
| 3064910 | 1 | |
| 2948448 | 1 | |
| 2481232 | 1 | |
| 2310322 | 1 | |
| 1976650 | 1 | |
| 1970176 | 1 | |
| 1933059 | 1 | |
| 1772572 | 1 | |
| 1726152 | 1 |
LIVING_AREA
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED 
| Distinct | 21808 |
|---|---|
| Distinct (%) | 14.7% |
| Missing | 34141 |
| Missing (%) | 18.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4437.7612 |
| Minimum | 2 |
|---|---|
| Maximum | 6982322 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 542 |
| Q1 | 942 |
| median | 1483.5 |
| Q3 | 2600 |
| 95-th percentile | 5768 |
| Maximum | 6982322 |
| Range | 6982320 |
| Interquartile range (IQR) | 1658 |
Descriptive statistics
| Standard deviation | 38453.214 |
|---|---|
| Coefficient of variation (CV) | 8.665003 |
| Kurtosis | 8373.4206 |
| Mean | 4437.7612 |
| Median Absolute Deviation (MAD) | 689.5 |
| Skewness | 63.652416 |
| Sum | 6.5723688 × 108 |
| Variance | 1.4786497 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 780 | 291 | 0.2% |
| 800 | 263 | 0.1% |
| 1008 | 261 | 0.1% |
| 1050 | 250 | 0.1% |
| 1224 | 243 | 0.1% |
| 600 | 239 | 0.1% |
| 700 | 235 | 0.1% |
| 625 | 223 | 0.1% |
| 1000 | 222 | 0.1% |
| 960 | 221 | 0.1% |
| Other values (21798) | 145653 | |
| (Missing) | 34141 | 18.7% |
| Value | Count | Frequency (%) |
| 2 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 42 | 1 | < 0.1% |
| 82 | 1 | < 0.1% |
| 90 | 1 | < 0.1% |
| 100 | 122 | |
| 102 | 1 | < 0.1% |
| 106 | 1 | < 0.1% |
| 108 | 1 | < 0.1% |
| 112 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 6982322 | 1 | |
| 2898078 | 1 | |
| 2882794 | 1 | |
| 2413114 | 1 | |
| 2310322 | 1 | |
| 1940476 | 1 | |
| 1885420 | 1 | |
| 1694084 | 1 | |
| 1595056 | 1 | |
| 1504200 | 1 |
LAND_VALUE
Text
| Distinct | 16658 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 1 |
| Mean length | 3.909697 |
| Min length | 1 |
Characters and Unicode
| Total characters | 712511 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 9985 ? |
|---|---|
| Unique (%) | 5.5% |
Sample
| 1st row | 197,600 |
|---|---|
| 2nd row | 198,500 |
| 3rd row | 199,100 |
| 4th row | 199,700 |
| 5th row | 230,200 |
| Value | Count | Frequency (%) |
| 0 | 94289 | |
| 218,300 | 66 | < 0.1% |
| 203,200 | 63 | < 0.1% |
| 238,200 | 59 | < 0.1% |
| 250,000 | 58 | < 0.1% |
| 233,500 | 57 | < 0.1% |
| 239,800 | 57 | < 0.1% |
| 240,900 | 57 | < 0.1% |
| 218,500 | 57 | < 0.1% |
| 229,900 | 57 | < 0.1% |
| Other values (16648) | 87422 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 288645 | |
| , | 94143 | 13.2% |
| 2 | 63438 | 8.9% |
| 1 | 51469 | 7.2% |
| 3 | 39788 | 5.6% |
| 4 | 32613 | 4.6% |
| 5 | 30068 | 4.2% |
| 6 | 28788 | 4.0% |
| 7 | 28041 | 3.9% |
| 8 | 27791 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 712511 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 288645 | |
| , | 94143 | 13.2% |
| 2 | 63438 | 8.9% |
| 1 | 51469 | 7.2% |
| 3 | 39788 | 5.6% |
| 4 | 32613 | 4.6% |
| 5 | 30068 | 4.2% |
| 6 | 28788 | 4.0% |
| 7 | 28041 | 3.9% |
| 8 | 27791 | 3.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 712511 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 288645 | |
| , | 94143 | 13.2% |
| 2 | 63438 | 8.9% |
| 1 | 51469 | 7.2% |
| 3 | 39788 | 5.6% |
| 4 | 32613 | 4.6% |
| 5 | 30068 | 4.2% |
| 6 | 28788 | 4.0% |
| 7 | 28041 | 3.9% |
| 8 | 27791 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 712511 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 288645 | |
| , | 94143 | 13.2% |
| 2 | 63438 | 8.9% |
| 1 | 51469 | 7.2% |
| 3 | 39788 | 5.6% |
| 4 | 32613 | 4.6% |
| 5 | 30068 | 4.2% |
| 6 | 28788 | 4.0% |
| 7 | 28041 | 3.9% |
| 8 | 27791 | 3.9% |
BLDG_VALUE
Text
| Distinct | 28352 |
|---|---|
| Distinct (%) | 15.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 6.5419278 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1192214 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 14035 ? |
|---|---|
| Unique (%) | 7.7% |
Sample
| 1st row | 594,400 |
|---|---|
| 2nd row | 619,700 |
| 3rd row | 605,300 |
| 4th row | 535,600 |
| 5th row | 501,400 |
| Value | Count | Frequency (%) |
| 0 | 20018 | 11.0% |
| 200 | 2213 | 1.2% |
| 60,000 | 674 | 0.4% |
| 40,000 | 565 | 0.3% |
| 43,000 | 481 | 0.3% |
| 90,000 | 342 | 0.2% |
| 38,000 | 308 | 0.2% |
| 74,600 | 305 | 0.2% |
| 48,000 | 280 | 0.2% |
| 108,000 | 275 | 0.2% |
| Other values (28342) | 156781 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 396479 | |
| , | 187305 | |
| 4 | 77028 | 6.5% |
| 3 | 74480 | 6.2% |
| 1 | 74375 | 6.2% |
| 5 | 71824 | 6.0% |
| 6 | 67827 | 5.7% |
| 2 | 67112 | 5.6% |
| 7 | 62524 | 5.2% |
| 8 | 58227 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1192214 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 396479 | |
| , | 187305 | |
| 4 | 77028 | 6.5% |
| 3 | 74480 | 6.2% |
| 1 | 74375 | 6.2% |
| 5 | 71824 | 6.0% |
| 6 | 67827 | 5.7% |
| 2 | 67112 | 5.6% |
| 7 | 62524 | 5.2% |
| 8 | 58227 | 4.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1192214 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 396479 | |
| , | 187305 | |
| 4 | 77028 | 6.5% |
| 3 | 74480 | 6.2% |
| 1 | 74375 | 6.2% |
| 5 | 71824 | 6.0% |
| 6 | 67827 | 5.7% |
| 2 | 67112 | 5.6% |
| 7 | 62524 | 5.2% |
| 8 | 58227 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1192214 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 396479 | |
| , | 187305 | |
| 4 | 77028 | 6.5% |
| 3 | 74480 | 6.2% |
| 1 | 74375 | 6.2% |
| 5 | 71824 | 6.0% |
| 6 | 67827 | 5.7% |
| 2 | 67112 | 5.6% |
| 7 | 62524 | 5.2% |
| 8 | 58227 | 4.9% |
SFYI_VALUE
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 0 |
|---|
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 182242 |
|---|---|
| Distinct characters | 1 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 182242 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 182242 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 182242 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 182242 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 182242 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 182242 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 182242 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 182242 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 182242 |
TOTAL_VALUE
Text
| Distinct | 32201 |
|---|---|
| Distinct (%) | 17.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 7.0112323 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1277741 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 15263 ? |
|---|---|
| Unique (%) | 8.4% |
Sample
| 1st row | 792,000 |
|---|---|
| 2nd row | 818,200 |
| 3rd row | 804,400 |
| 4th row | 735,300 |
| 5th row | 731,600 |
| Value | Count | Frequency (%) |
| 0 | 10774 | 5.9% |
| 60,000 | 681 | 0.4% |
| 40,000 | 580 | 0.3% |
| 43,000 | 491 | 0.3% |
| 90,000 | 345 | 0.2% |
| 38,000 | 318 | 0.2% |
| 74,600 | 307 | 0.2% |
| 48,000 | 293 | 0.2% |
| 108,000 | 277 | 0.2% |
| 47,000 | 272 | 0.1% |
| Other values (32191) | 167904 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 413329 | |
| , | 212603 | |
| 1 | 90965 | 7.1% |
| 4 | 76178 | 6.0% |
| 5 | 74929 | 5.9% |
| 6 | 73504 | 5.8% |
| 3 | 71845 | 5.6% |
| 7 | 69909 | 5.5% |
| 2 | 67851 | 5.3% |
| 8 | 64956 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1277741 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 413329 | |
| , | 212603 | |
| 1 | 90965 | 7.1% |
| 4 | 76178 | 6.0% |
| 5 | 74929 | 5.9% |
| 6 | 73504 | 5.8% |
| 3 | 71845 | 5.6% |
| 7 | 69909 | 5.5% |
| 2 | 67851 | 5.3% |
| 8 | 64956 | 5.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1277741 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 413329 | |
| , | 212603 | |
| 1 | 90965 | 7.1% |
| 4 | 76178 | 6.0% |
| 5 | 74929 | 5.9% |
| 6 | 73504 | 5.8% |
| 3 | 71845 | 5.6% |
| 7 | 69909 | 5.5% |
| 2 | 67851 | 5.3% |
| 8 | 64956 | 5.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1277741 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 413329 | |
| , | 212603 | |
| 1 | 90965 | 7.1% |
| 4 | 76178 | 6.0% |
| 5 | 74929 | 5.9% |
| 6 | 73504 | 5.8% |
| 3 | 71845 | 5.6% |
| 7 | 69909 | 5.5% |
| 2 | 67851 | 5.3% |
| 8 | 64956 | 5.1% |
GROSS_TAX
Text
| Distinct | 34946 |
|---|---|
| Distinct (%) | 19.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 10.613882 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1934295 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 18669 ? |
|---|---|
| Unique (%) | 10.2% |
Sample
| 1st row | $8,632.80 |
|---|---|
| 2nd row | $8,918.38 |
| 3rd row | $8,767.96 |
| 4th row | $8,014.77 |
| 5th row | $7,974.44 |
| Value | Count | Frequency (%) |
| 18617 | 10.2% | |
| 654.00 | 676 | 0.4% |
| 436.00 | 577 | 0.3% |
| 468.70 | 441 | 0.2% |
| 981.00 | 343 | 0.2% |
| 813.14 | 305 | 0.2% |
| 523.20 | 290 | 0.2% |
| 414.20 | 286 | 0.2% |
| 1,177.20 | 276 | 0.2% |
| 512.30 | 268 | 0.1% |
| Other values (34936) | 160163 |
Most occurring characters
| Value | Count | Frequency (%) |
| 401718 | ||
| $ | 182242 | |
| . | 163625 | |
| , | 150824 | 7.8% |
| 1 | 127167 | 6.6% |
| 6 | 103329 | 5.3% |
| 4 | 102418 | 5.3% |
| 5 | 101394 | 5.2% |
| 7 | 99792 | 5.2% |
| 3 | 99129 | 5.1% |
| Other values (5) | 402657 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1934295 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 401718 | ||
| $ | 182242 | |
| . | 163625 | |
| , | 150824 | 7.8% |
| 1 | 127167 | 6.6% |
| 6 | 103329 | 5.3% |
| 4 | 102418 | 5.3% |
| 5 | 101394 | 5.2% |
| 7 | 99792 | 5.2% |
| 3 | 99129 | 5.1% |
| Other values (5) | 402657 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1934295 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 401718 | ||
| $ | 182242 | |
| . | 163625 | |
| , | 150824 | 7.8% |
| 1 | 127167 | 6.6% |
| 6 | 103329 | 5.3% |
| 4 | 102418 | 5.3% |
| 5 | 101394 | 5.2% |
| 7 | 99792 | 5.2% |
| 3 | 99129 | 5.1% |
| Other values (5) | 402657 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1934295 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 401718 | ||
| $ | 182242 | |
| . | 163625 | |
| , | 150824 | 7.8% |
| 1 | 127167 | 6.6% |
| 6 | 103329 | 5.3% |
| 4 | 102418 | 5.3% |
| 5 | 101394 | 5.2% |
| 7 | 99792 | 5.2% |
| 3 | 99129 | 5.1% |
| Other values (5) | 402657 |
YR_BUILT
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED 
| Distinct | 236 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 22786 |
| Missing (%) | 12.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1933.2157 |
| Minimum | 1700 |
|---|---|
| Maximum | 20198 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 1700 |
|---|---|
| 5-th percentile | 1880 |
| Q1 | 1900 |
| median | 1920 |
| Q3 | 1965 |
| 95-th percentile | 2017 |
| Maximum | 20198 |
| Range | 18498 |
| Interquartile range (IQR) | 65 |
Descriptive statistics
| Standard deviation | 63.981908 |
|---|---|
| Coefficient of variation (CV) | 0.033096104 |
| Kurtosis | 41646.839 |
| Mean | 1933.2157 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | 146.10676 |
| Sum | 3.0826284 × 108 |
| Variance | 4093.6845 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1900 | 17988 | 9.9% |
| 1920 | 12195 | 6.7% |
| 1910 | 11790 | 6.5% |
| 1905 | 10567 | 5.8% |
| 1899 | 9866 | 5.4% |
| 1890 | 8985 | 4.9% |
| 1930 | 4176 | 2.3% |
| 1999 | 3820 | 2.1% |
| 1925 | 3785 | 2.1% |
| 1880 | 3320 | 1.8% |
| Other values (226) | 72964 | |
| (Missing) | 22786 | 12.5% |
| Value | Count | Frequency (%) |
| 1700 | 1 | |
| 1710 | 1 | |
| 1725 | 2 | |
| 1752 | 2 | |
| 1760 | 1 | |
| 1775 | 1 | |
| 1779 | 1 | |
| 1780 | 1 | |
| 1785 | 2 | |
| 1789 | 1 |
| Value | Count | Frequency (%) |
| 20198 | 1 | < 0.1% |
| 2023 | 8 | < 0.1% |
| 2022 | 340 | 0.2% |
| 2021 | 1224 | |
| 2020 | 1613 | |
| 2019 | 1158 | |
| 2018 | 2437 | |
| 2017 | 2185 | |
| 2016 | 1603 | |
| 2015 | 1392 |
YR_REMODEL
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 106 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 95524 |
| Missing (%) | 52.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2002.0116 |
| Minimum | 0 |
|---|---|
| Maximum | 20220 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1979 |
| Q1 | 1987 |
| median | 2005 |
| Q3 | 2016 |
| 95-th percentile | 2021 |
| Maximum | 20220 |
| Range | 20220 |
| Interquartile range (IQR) | 29 |
Descriptive statistics
| Standard deviation | 65.386476 |
|---|---|
| Coefficient of variation (CV) | 0.032660387 |
| Kurtosis | 69535.907 |
| Mean | 2002.0116 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 248.07278 |
| Sum | 1.7361045 × 108 |
| Variance | 4275.3912 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1985 | 3962 | 2.2% |
| 2017 | 3910 | 2.1% |
| 2021 | 3361 | 1.8% |
| 2005 | 3207 | 1.8% |
| 2019 | 3124 | 1.7% |
| 1980 | 3112 | 1.7% |
| 2018 | 3029 | 1.7% |
| 2016 | 2893 | 1.6% |
| 2022 | 2718 | 1.5% |
| 2004 | 2711 | 1.5% |
| Other values (96) | 54691 | |
| (Missing) | 95524 |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 201 | 2 | < 0.1% |
| 221 | 1 | < 0.1% |
| 1900 | 9 | |
| 1902 | 1 | < 0.1% |
| 1904 | 1 | < 0.1% |
| 1910 | 1 | < 0.1% |
| 1914 | 3 | < 0.1% |
| 1915 | 1 | < 0.1% |
| 1916 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 20220 | 1 | < 0.1% |
| 2921 | 1 | < 0.1% |
| 2121 | 1 | < 0.1% |
| 2023 | 118 | 0.1% |
| 2022 | 2718 | |
| 2021 | 3361 | |
| 2020 | 2624 | |
| 2019 | 3124 | |
| 2018 | 3029 | |
| 2017 | 3910 |
STRUCTURE_CLASS
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 164836 |
| Missing (%) | 90.4% |
| Memory size | 1.4 MiB |
| C - Brick/Concr | |
|---|---|
| D - Wood/Frame | |
| B - Reinf Concr | |
| A - Struct Steel | 925 |
| E - Metal | 103 |
Length
| Max length | 16 |
|---|---|
| Median length | 15 |
| Mean length | 14.757497 |
| Min length | 9 |
Characters and Unicode
| Total characters | 256869 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | D - Wood/Frame |
|---|---|
| 2nd row | D - Wood/Frame |
| 3rd row | D - Wood/Frame |
| 4th row | D - Wood/Frame |
| 5th row | D - Wood/Frame |
Common Values
| Value | Count | Frequency (%) |
| C - Brick/Concr | 10201 | 5.6% |
| D - Wood/Frame | 4528 | 2.5% |
| B - Reinf Concr | 1649 | 0.9% |
| A - Struct Steel | 925 | 0.5% |
| E - Metal | 103 | 0.1% |
| (Missing) | 164836 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 17406 | ||
| c | 10201 | |
| brick/concr | 10201 | |
| d | 4528 | 8.3% |
| wood/frame | 4528 | 8.3% |
| b | 1649 | 3.0% |
| reinf | 1649 | 3.0% |
| concr | 1649 | 3.0% |
| a | 925 | 1.7% |
| struct | 925 | 1.7% |
| Other values (3) | 1131 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 37386 | ||
| r | 27504 | |
| c | 22976 | |
| C | 22051 | 8.6% |
| o | 20906 | 8.1% |
| - | 17406 | 6.8% |
| / | 14729 | 5.7% |
| n | 13499 | 5.3% |
| B | 11850 | 4.6% |
| i | 11850 | 4.6% |
| Other values (17) | 56712 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 256869 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 37386 | ||
| r | 27504 | |
| c | 22976 | |
| C | 22051 | 8.6% |
| o | 20906 | 8.1% |
| - | 17406 | 6.8% |
| / | 14729 | 5.7% |
| n | 13499 | 5.3% |
| B | 11850 | 4.6% |
| i | 11850 | 4.6% |
| Other values (17) | 56712 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 256869 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 37386 | ||
| r | 27504 | |
| c | 22976 | |
| C | 22051 | 8.6% |
| o | 20906 | 8.1% |
| - | 17406 | 6.8% |
| / | 14729 | 5.7% |
| n | 13499 | 5.3% |
| B | 11850 | 4.6% |
| i | 11850 | 4.6% |
| Other values (17) | 56712 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 256869 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 37386 | ||
| r | 27504 | |
| c | 22976 | |
| C | 22051 | 8.6% |
| o | 20906 | 8.1% |
| - | 17406 | 6.8% |
| / | 14729 | 5.7% |
| n | 13499 | 5.3% |
| B | 11850 | 4.6% |
| i | 11850 | 4.6% |
| Other values (17) | 56712 |
ROOF_STRUCTURE
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 36225 |
| Missing (%) | 19.9% |
| Memory size | 1.4 MiB |
| F - Flat | |
|---|---|
| G - Gable | |
| H - Hip | |
| M - Mansard | |
| L - Gambrel | 2153 |
| Other values (2) | 695 |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 8.5486279 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1248245 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F - Flat |
|---|---|
| 2nd row | F - Flat |
| 3rd row | F - Flat |
| 4th row | M - Mansard |
| 5th row | M - Mansard |
Common Values
| Value | Count | Frequency (%) |
| F - Flat | 68966 | |
| G - Gable | 46606 | |
| H - Hip | 14023 | 7.7% |
| M - Mansard | 13574 | 7.4% |
| L - Gambrel | 2153 | 1.2% |
| S - Shed | 350 | 0.2% |
| O - Other | 345 | 0.2% |
| (Missing) | 36225 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 146017 | ||
| f | 68966 | |
| flat | 68966 | |
| g | 46606 | 10.6% |
| gable | 46606 | 10.6% |
| h | 14023 | 3.2% |
| hip | 14023 | 3.2% |
| m | 13574 | 3.1% |
| mansard | 13574 | 3.1% |
| l | 2153 | 0.5% |
| Other values (5) | 3543 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 292034 | ||
| - | 146017 | |
| a | 144873 | |
| F | 137932 | |
| l | 117725 | |
| G | 95365 | 7.6% |
| t | 69311 | 5.6% |
| e | 49454 | 4.0% |
| b | 48759 | 3.9% |
| H | 28046 | 2.2% |
| Other values (12) | 118729 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1248245 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 292034 | ||
| - | 146017 | |
| a | 144873 | |
| F | 137932 | |
| l | 117725 | |
| G | 95365 | 7.6% |
| t | 69311 | 5.6% |
| e | 49454 | 4.0% |
| b | 48759 | 3.9% |
| H | 28046 | 2.2% |
| Other values (12) | 118729 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1248245 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 292034 | ||
| - | 146017 | |
| a | 144873 | |
| F | 137932 | |
| l | 117725 | |
| G | 95365 | 7.6% |
| t | 69311 | 5.6% |
| e | 49454 | 4.0% |
| b | 48759 | 3.9% |
| H | 28046 | 2.2% |
| Other values (12) | 118729 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1248245 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 292034 | ||
| - | 146017 | |
| a | 144873 | |
| F | 137932 | |
| l | 117725 | |
| G | 95365 | 7.6% |
| t | 69311 | 5.6% |
| e | 49454 | 4.0% |
| b | 48759 | 3.9% |
| H | 28046 | 2.2% |
| Other values (12) | 118729 |
ROOF_COVER
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 36219 |
| Missing (%) | 19.9% |
| Memory size | 1.4 MiB |
| A - Asphalt Shingl | |
|---|---|
| R - Rubber Roof | |
| C - Composition | |
| S - Slate | |
| O - Other | 1032 |
| Other values (2) | 383 |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 15.773043 |
| Min length | 8 |
Characters and Unicode
| Total characters | 2303227 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C - Composition |
|---|---|
| 2nd row | C - Composition |
| 3rd row | C - Composition |
| 4th row | C - Composition |
| 5th row | C - Composition |
Common Values
| Value | Count | Frequency (%) |
| A - Asphalt Shingl | 63863 | |
| R - Rubber Roof | 46495 | |
| C - Composition | 22459 | 12.3% |
| S - Slate | 11791 | 6.5% |
| O - Other | 1032 | 0.6% |
| T - Tile | 269 | 0.1% |
| W - Wood Shingle | 114 | 0.1% |
| (Missing) | 36219 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 146023 | ||
| a | 63863 | |
| asphalt | 63863 | |
| shingl | 63863 | |
| r | 46495 | 8.5% |
| rubber | 46495 | 8.5% |
| roof | 46495 | 8.5% |
| c | 22459 | 4.1% |
| composition | 22459 | 4.1% |
| slate | 11791 | 2.1% |
| Other values (8) | 14735 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 402518 | ||
| o | 160595 | 7.0% |
| - | 146023 | 6.3% |
| l | 139900 | 6.1% |
| R | 139485 | 6.1% |
| h | 128872 | 5.6% |
| A | 127726 | 5.5% |
| i | 109164 | 4.7% |
| t | 99145 | 4.3% |
| b | 92990 | 4.0% |
| Other values (16) | 756809 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2303227 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 402518 | ||
| o | 160595 | 7.0% |
| - | 146023 | 6.3% |
| l | 139900 | 6.1% |
| R | 139485 | 6.1% |
| h | 128872 | 5.6% |
| A | 127726 | 5.5% |
| i | 109164 | 4.7% |
| t | 99145 | 4.3% |
| b | 92990 | 4.0% |
| Other values (16) | 756809 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2303227 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 402518 | ||
| o | 160595 | 7.0% |
| - | 146023 | 6.3% |
| l | 139900 | 6.1% |
| R | 139485 | 6.1% |
| h | 128872 | 5.6% |
| A | 127726 | 5.5% |
| i | 109164 | 4.7% |
| t | 99145 | 4.3% |
| b | 92990 | 4.0% |
| Other values (16) | 756809 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2303227 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 402518 | ||
| o | 160595 | 7.0% |
| - | 146023 | 6.3% |
| l | 139900 | 6.1% |
| R | 139485 | 6.1% |
| h | 128872 | 5.6% |
| A | 127726 | 5.5% |
| i | 109164 | 4.7% |
| t | 99145 | 4.3% |
| b | 92990 | 4.0% |
| Other values (16) | 756809 |
INT_WALL
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 48749 |
| Missing (%) | 26.7% |
| Memory size | 1.4 MiB |
| N - Normal | |
|---|---|
| E - Elaborate | 4788 |
| S - Substandard | 80 |
| G - Good | 3 |
Length
| Max length | 15 |
|---|---|
| Median length | 10 |
| Mean length | 10.110553 |
| Min length | 8 |
Characters and Unicode
| Total characters | 1349688 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | N - Normal |
|---|---|
| 2nd row | N - Normal |
| 3rd row | N - Normal |
| 4th row | N - Normal |
| 5th row | N - Normal |
Common Values
| Value | Count | Frequency (%) |
| N - Normal | 128622 | |
| E - Elaborate | 4788 | 2.6% |
| S - Substandard | 80 | < 0.1% |
| G - Good | 3 | < 0.1% |
| (Missing) | 48749 | 26.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 133493 | ||
| n | 128622 | |
| normal | 128622 | |
| e | 4788 | 1.2% |
| elaborate | 4788 | 1.2% |
| s | 80 | < 0.1% |
| substandard | 80 | < 0.1% |
| g | 3 | < 0.1% |
| good | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 266986 | ||
| N | 257244 | |
| a | 138358 | |
| - | 133493 | |
| r | 133490 | |
| o | 133416 | |
| l | 133410 | |
| m | 128622 | |
| E | 9576 | 0.7% |
| t | 4868 | 0.4% |
| Other values (8) | 10225 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1349688 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 266986 | ||
| N | 257244 | |
| a | 138358 | |
| - | 133493 | |
| r | 133490 | |
| o | 133416 | |
| l | 133410 | |
| m | 128622 | |
| E | 9576 | 0.7% |
| t | 4868 | 0.4% |
| Other values (8) | 10225 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1349688 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 266986 | ||
| N | 257244 | |
| a | 138358 | |
| - | 133493 | |
| r | 133490 | |
| o | 133416 | |
| l | 133410 | |
| m | 128622 | |
| E | 9576 | 0.7% |
| t | 4868 | 0.4% |
| Other values (8) | 10225 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1349688 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 266986 | ||
| N | 257244 | |
| a | 138358 | |
| - | 133493 | |
| r | 133490 | |
| o | 133416 | |
| l | 133410 | |
| m | 128622 | |
| E | 9576 | 0.7% |
| t | 4868 | 0.4% |
| Other values (8) | 10225 | 0.8% |
EXT_FNISHED
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 28 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 22884 |
| Missing (%) | 12.6% |
| Memory size | 1.4 MiB |
| B - Brick/Stone | |
|---|---|
| M - Vinyl | |
| W - Wood Shake | |
| F - Frame/Clapbrd | |
| C - Cement Board | |
| Other values (23) |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 13.014464 |
| Min length | 9 |
Characters and Unicode
| Total characters | 2073959 |
|---|---|
| Distinct characters | 49 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | A - Asbestos |
|---|---|
| 2nd row | M - Vinyl |
| 3rd row | M - Vinyl |
| 4th row | M - Vinyl |
| 5th row | M - Vinyl |
Common Values
| Value | Count | Frequency (%) |
| B - Brick/Stone | 52403 | |
| M - Vinyl | 42605 | |
| W - Wood Shake | 16453 | 9.0% |
| F - Frame/Clapbrd | 13337 | 7.3% |
| C - Cement Board | 8992 | 4.9% |
| 01 - Brick | 7693 | 4.2% |
| A - Asbestos | 3967 | 2.2% |
| G - Glass | 3586 | 2.0% |
| 09 - Wood Siding | 1762 | 1.0% |
| S - Stucco | 1309 | 0.7% |
| Other values (18) | 7251 | 4.0% |
| (Missing) | 22884 |
Length
| Value | Count | Frequency (%) |
| 159427 | ||
| brick/stone | 52403 | 10.3% |
| b | 52403 | 10.3% |
| m | 42605 | 8.4% |
| vinyl | 42605 | 8.4% |
| wood | 18215 | 3.6% |
| w | 16453 | 3.2% |
| shake | 16453 | 3.2% |
| f | 13337 | 2.6% |
| frame/clapbrd | 13337 | 2.6% |
| Other values (52) | 82230 |
Most occurring characters
| Value | Count | Frequency (%) |
| 350110 | ||
| - | 159358 | 7.7% |
| B | 123354 | 5.9% |
| n | 112511 | 5.4% |
| i | 109081 | 5.3% |
| e | 108101 | 5.2% |
| o | 107712 | 5.2% |
| r | 101306 | 4.9% |
| k | 78411 | 3.8% |
| S | 75848 | 3.7% |
| Other values (39) | 748167 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2073959 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 350110 | ||
| - | 159358 | 7.7% |
| B | 123354 | 5.9% |
| n | 112511 | 5.4% |
| i | 109081 | 5.3% |
| e | 108101 | 5.2% |
| o | 107712 | 5.2% |
| r | 101306 | 4.9% |
| k | 78411 | 3.8% |
| S | 75848 | 3.7% |
| Other values (39) | 748167 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2073959 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 350110 | ||
| - | 159358 | 7.7% |
| B | 123354 | 5.9% |
| n | 112511 | 5.4% |
| i | 109081 | 5.3% |
| e | 108101 | 5.2% |
| o | 107712 | 5.2% |
| r | 101306 | 4.9% |
| k | 78411 | 3.8% |
| S | 75848 | 3.7% |
| Other values (39) | 748167 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2073959 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 350110 | ||
| - | 159358 | 7.7% |
| B | 123354 | 5.9% |
| n | 112511 | 5.4% |
| i | 109081 | 5.3% |
| e | 108101 | 5.2% |
| o | 107712 | 5.2% |
| r | 101306 | 4.9% |
| k | 78411 | 3.8% |
| S | 75848 | 3.7% |
| Other values (39) | 748167 |
INT_COND
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 48746 |
| Missing (%) | 26.7% |
| Memory size | 1.4 MiB |
| A - Average | |
|---|---|
| G - Good | |
| E - Excellent | |
| F - Fair | 1198 |
| P - Poor | 88 |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 9.9508525 |
| Min length | 8 |
Characters and Unicode
| Total characters | 1328399 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A - Average |
|---|---|
| 2nd row | A - Average |
| 3rd row | A - Average |
| 4th row | A - Average |
| 5th row | A - Average |
Common Values
| Value | Count | Frequency (%) |
| A - Average | 63032 | |
| G - Good | 54911 | |
| E - Excellent | 14267 | 7.8% |
| F - Fair | 1198 | 0.7% |
| P - Poor | 88 | < 0.1% |
| (Missing) | 48746 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 133496 | ||
| a | 63032 | |
| average | 63032 | |
| g | 54911 | |
| good | 54911 | |
| e | 14267 | 3.6% |
| excellent | 14267 | 3.6% |
| f | 1198 | 0.3% |
| fair | 1198 | 0.3% |
| p | 88 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 266992 | ||
| e | 154598 | |
| - | 133496 | |
| A | 126064 | |
| o | 109998 | |
| G | 109822 | |
| r | 64318 | 4.8% |
| a | 64230 | 4.8% |
| v | 63032 | 4.7% |
| g | 63032 | 4.7% |
| Other values (10) | 172817 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1328399 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 266992 | ||
| e | 154598 | |
| - | 133496 | |
| A | 126064 | |
| o | 109998 | |
| G | 109822 | |
| r | 64318 | 4.8% |
| a | 64230 | 4.8% |
| v | 63032 | 4.7% |
| g | 63032 | 4.7% |
| Other values (10) | 172817 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1328399 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 266992 | ||
| e | 154598 | |
| - | 133496 | |
| A | 126064 | |
| o | 109998 | |
| G | 109822 | |
| r | 64318 | 4.8% |
| a | 64230 | 4.8% |
| v | 63032 | 4.7% |
| g | 63032 | 4.7% |
| Other values (10) | 172817 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1328399 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 266992 | ||
| e | 154598 | |
| - | 133496 | |
| A | 126064 | |
| o | 109998 | |
| G | 109822 | |
| r | 64318 | 4.8% |
| a | 64230 | 4.8% |
| v | 63032 | 4.7% |
| g | 63032 | 4.7% |
| Other values (10) | 172817 |
EXT_COND
Categorical
MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 36158 |
| Missing (%) | 19.8% |
| Memory size | 1.4 MiB |
| A - Average | |
|---|---|
| G - Good | |
| E - Excellent | |
| F - Fair | 2215 |
| P - Poor | 61 |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 9.9419033 |
| Min length | 8 |
Characters and Unicode
| Total characters | 1452353 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F - Fair |
|---|---|
| 2nd row | A - Average |
| 3rd row | G - Good |
| 4th row | A - Average |
| 5th row | F - Fair |
Common Values
| Value | Count | Frequency (%) |
| A - Average | 78882 | |
| G - Good | 55519 | |
| E - Excellent | 9407 | 5.2% |
| F - Fair | 2215 | 1.2% |
| P - Poor | 61 | < 0.1% |
| (Missing) | 36158 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 146084 | ||
| a | 78882 | |
| average | 78882 | |
| g | 55519 | 12.7% |
| good | 55519 | 12.7% |
| e | 9407 | 2.1% |
| excellent | 9407 | 2.1% |
| f | 2215 | 0.5% |
| fair | 2215 | 0.5% |
| p | 61 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 292168 | ||
| e | 176578 | |
| A | 157764 | |
| - | 146084 | |
| o | 111160 | 7.7% |
| G | 111038 | 7.6% |
| r | 81158 | 5.6% |
| a | 81097 | 5.6% |
| v | 78882 | 5.4% |
| g | 78882 | 5.4% |
| Other values (10) | 137542 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1452353 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 292168 | ||
| e | 176578 | |
| A | 157764 | |
| - | 146084 | |
| o | 111160 | 7.7% |
| G | 111038 | 7.6% |
| r | 81158 | 5.6% |
| a | 81097 | 5.6% |
| v | 78882 | 5.4% |
| g | 78882 | 5.4% |
| Other values (10) | 137542 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1452353 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 292168 | ||
| e | 176578 | |
| A | 157764 | |
| - | 146084 | |
| o | 111160 | 7.7% |
| G | 111038 | 7.6% |
| r | 81158 | 5.6% |
| a | 81097 | 5.6% |
| v | 78882 | 5.4% |
| g | 78882 | 5.4% |
| Other values (10) | 137542 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1452353 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 292168 | ||
| e | 176578 | |
| A | 157764 | |
| - | 146084 | |
| o | 111160 | 7.7% |
| G | 111038 | 7.6% |
| r | 81158 | 5.6% |
| a | 81097 | 5.6% |
| v | 78882 | 5.4% |
| g | 78882 | 5.4% |
| Other values (10) | 137542 |
OVERALL_COND
Categorical
IMBALANCE  MISSING 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9587 |
| Missing (%) | 5.3% |
| Memory size | 1.4 MiB |
| A - Average | |
|---|---|
| G - Good | |
| E - Excellent | 1728 |
| VG - Very Good | 1380 |
| EX - Excellent | 1252 |
| Other values (5) | 1148 |
Length
| Max length | 23 |
|---|---|
| Median length | 11 |
| Mean length | 10.412684 |
| Min length | 8 |
Characters and Unicode
| Total characters | 1797802 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | A - Average |
|---|---|
| 2nd row | A - Average |
| 3rd row | A - Average |
| 4th row | A - Average |
| 5th row | A - Average |
Common Values
| Value | Count | Frequency (%) |
| A - Average | 130657 | |
| G - Good | 36490 | 20.0% |
| E - Excellent | 1728 | 0.9% |
| VG - Very Good | 1380 | 0.8% |
| EX - Excellent | 1252 | 0.7% |
| F - Fair | 1019 | 0.6% |
| P - Poor | 98 | 0.1% |
| US - Unsound | 18 | < 0.1% |
| VP - Very Poor | 12 | < 0.1% |
| AVG - Default - Average | 1 | < 0.1% |
| (Missing) | 9587 | 5.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 172656 | ||
| average | 130658 | |
| a | 130657 | |
| good | 37870 | 7.3% |
| g | 36490 | 7.0% |
| excellent | 2980 | 0.6% |
| e | 1728 | 0.3% |
| very | 1392 | 0.3% |
| vg | 1380 | 0.3% |
| ex | 1252 | 0.2% |
| Other values (9) | 2296 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 346704 | ||
| e | 268669 | |
| A | 261316 | |
| - | 172656 | |
| r | 133179 | 7.4% |
| a | 131678 | 7.3% |
| v | 130658 | 7.3% |
| g | 130658 | 7.3% |
| o | 75978 | 4.2% |
| G | 75741 | 4.2% |
| Other values (19) | 70565 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1797802 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 346704 | ||
| e | 268669 | |
| A | 261316 | |
| - | 172656 | |
| r | 133179 | 7.4% |
| a | 131678 | 7.3% |
| v | 130658 | 7.3% |
| g | 130658 | 7.3% |
| o | 75978 | 4.2% |
| G | 75741 | 4.2% |
| Other values (19) | 70565 | 3.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1797802 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 346704 | ||
| e | 268669 | |
| A | 261316 | |
| - | 172656 | |
| r | 133179 | 7.4% |
| a | 131678 | 7.3% |
| v | 130658 | 7.3% |
| g | 130658 | 7.3% |
| o | 75978 | 4.2% |
| G | 75741 | 4.2% |
| Other values (19) | 70565 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1797802 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 346704 | ||
| e | 268669 | |
| A | 261316 | |
| - | 172656 | |
| r | 133179 | 7.4% |
| a | 131678 | 7.3% |
| v | 130658 | 7.3% |
| g | 130658 | 7.3% |
| o | 75978 | 4.2% |
| G | 75741 | 4.2% |
| Other values (19) | 70565 | 3.9% |
BED_RMS
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 48765 |
| Missing (%) | 26.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.1484376 |
| Minimum | 0 |
|---|---|
| Maximum | 21 |
| Zeros | 3184 |
| Zeros (%) | 1.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 8 |
| Maximum | 21 |
| Range | 21 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.1022194 |
|---|---|
| Coefficient of variation (CV) | 0.66770242 |
| Kurtosis | 2.2563752 |
| Mean | 3.1484376 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.3651924 |
| Sum | 420244 |
| Variance | 4.4193264 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 38002 | |
| 3 | 27552 | |
| 1 | 22279 | |
| 4 | 15328 | 8.4% |
| 6 | 9957 | 5.5% |
| 5 | 8124 | 4.5% |
| 9 | 3275 | 1.8% |
| 0 | 3184 | 1.7% |
| 8 | 2297 | 1.3% |
| 7 | 2208 | 1.2% |
| Other values (9) | 1271 | 0.7% |
| (Missing) | 48765 |
| Value | Count | Frequency (%) |
| 0 | 3184 | 1.7% |
| 1 | 22279 | |
| 2 | 38002 | |
| 3 | 27552 | |
| 4 | 15328 | |
| 5 | 8124 | 4.5% |
| 6 | 9957 | 5.5% |
| 7 | 2208 | 1.2% |
| 8 | 2297 | 1.3% |
| 9 | 3275 | 1.8% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 17 | 5 | < 0.1% |
| 16 | 2 | < 0.1% |
| 15 | 27 | < 0.1% |
| 14 | 54 | < 0.1% |
| 13 | 32 | < 0.1% |
| 12 | 351 | 0.2% |
| 11 | 407 | 0.2% |
| 10 | 392 | 0.2% |
| 9 | 3275 |
FULL_BTH
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11644 |
| Missing (%) | 6.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.359758 |
| Minimum | 0 |
|---|---|
| Maximum | 21 |
| Zeros | 36940 |
| Zeros (%) | 20.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 21 |
| Range | 21 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.0605678 |
|---|---|
| Coefficient of variation (CV) | 0.77996806 |
| Kurtosis | 3.1724521 |
| Mean | 1.359758 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.921087 |
| Sum | 231972 |
| Variance | 1.1248041 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 64877 | |
| 2 | 45213 | |
| 0 | 36940 | |
| 3 | 19794 | 10.9% |
| 4 | 2551 | 1.4% |
| 6 | 580 | 0.3% |
| 5 | 519 | 0.3% |
| 7 | 65 | < 0.1% |
| 8 | 35 | < 0.1% |
| 9 | 12 | < 0.1% |
| Other values (7) | 12 | < 0.1% |
| (Missing) | 11644 | 6.4% |
| Value | Count | Frequency (%) |
| 0 | 36940 | |
| 1 | 64877 | |
| 2 | 45213 | |
| 3 | 19794 | 10.9% |
| 4 | 2551 | 1.4% |
| 5 | 519 | 0.3% |
| 6 | 580 | 0.3% |
| 7 | 65 | < 0.1% |
| 8 | 35 | < 0.1% |
| 9 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 21 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 3 | < 0.1% |
| 12 | 2 | < 0.1% |
| 10 | 2 | < 0.1% |
| 9 | 12 | < 0.1% |
| 8 | 35 | |
| 7 | 65 |
HLF_BTH
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11509 |
| Missing (%) | 6.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.22192546 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 135661 |
| Zeros (%) | 74.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.46002771 |
|---|---|
| Coefficient of variation (CV) | 2.0728929 |
| Kurtosis | 5.6723827 |
| Mean | 0.22192546 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.1362235 |
| Sum | 37890 |
| Variance | 0.2116255 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 135661 | |
| 1 | 32695 | 17.9% |
| 2 | 1983 | 1.1% |
| 3 | 361 | 0.2% |
| 4 | 23 | < 0.1% |
| 5 | 7 | < 0.1% |
| 6 | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
| (Missing) | 11509 | 6.3% |
| Value | Count | Frequency (%) |
| 0 | 135661 | |
| 1 | 32695 | 17.9% |
| 2 | 1983 | 1.1% |
| 3 | 361 | 0.2% |
| 4 | 23 | < 0.1% |
| 5 | 7 | < 0.1% |
| 6 | 2 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 6 | 2 | < 0.1% |
| 5 | 7 | < 0.1% |
| 4 | 23 | < 0.1% |
| 3 | 361 | 0.2% |
| 2 | 1983 | 1.1% |
| 1 | 32695 | 17.9% |
| 0 | 135661 |
KITCHENS
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 11718 |
| Missing (%) | 6.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0518813 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 36863 |
| Zeros (%) | 20.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.8053745 |
|---|---|
| Coefficient of variation (CV) | 0.76565153 |
| Kurtosis | 0.7498689 |
| Mean | 1.0518813 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.8745803 |
| Sum | 179371 |
| Variance | 0.64862808 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 102061 | |
| 0 | 36863 | 20.2% |
| 2 | 17625 | 9.7% |
| 3 | 13841 | 7.6% |
| 4 | 133 | 0.1% |
| 5 | 1 | < 0.1% |
| (Missing) | 11718 | 6.4% |
| Value | Count | Frequency (%) |
| 0 | 36863 | 20.2% |
| 1 | 102061 | |
| 2 | 17625 | 9.7% |
| 3 | 13841 | 7.6% |
| 4 | 133 | 0.1% |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 1 | < 0.1% |
| 4 | 133 | 0.1% |
| 3 | 13841 | 7.6% |
| 2 | 17625 | 9.7% |
| 1 | 102061 | |
| 0 | 36863 | 20.2% |
TT_RMS
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 48829 |
| Missing (%) | 26.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.940583 |
| Minimum | 1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 15 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.0097978 |
|---|---|
| Coefficient of variation (CV) | 0.57773214 |
| Kurtosis | 0.54651164 |
| Mean | 6.940583 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 1.1306302 |
| Sum | 925964 |
| Variance | 16.078479 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 24165 | |
| 5 | 18593 | 10.2% |
| 3 | 15940 | 8.7% |
| 6 | 15592 | 8.6% |
| 7 | 10547 | 5.8% |
| 8 | 7496 | 4.1% |
| 10 | 5715 | 3.1% |
| 12 | 5159 | 2.8% |
| 9 | 4870 | 2.7% |
| 2 | 4729 | 2.6% |
| Other values (10) | 20607 | |
| (Missing) | 48829 |
| Value | Count | Frequency (%) |
| 1 | 697 | 0.4% |
| 2 | 4729 | 2.6% |
| 3 | 15940 | |
| 4 | 24165 | |
| 5 | 18593 | |
| 6 | 15592 | |
| 7 | 10547 | |
| 8 | 7496 | 4.1% |
| 9 | 4870 | 2.7% |
| 10 | 5715 | 3.1% |
| Value | Count | Frequency (%) |
| 20 | 598 | 0.3% |
| 19 | 183 | 0.1% |
| 18 | 2387 | |
| 17 | 1286 | 0.7% |
| 16 | 993 | 0.5% |
| 15 | 4658 | |
| 14 | 3195 | |
| 13 | 2332 | |
| 12 | 5159 | |
| 11 | 4278 |
BDRM_COND
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 110500 |
| Missing (%) | 60.6% |
| Memory size | 1.4 MiB |
| A - Average | |
|---|---|
| G - Good | |
| E - Excellent | 962 |
| F - Fair | 837 |
| P - Poor | 62 |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 10.437498 |
| Min length | 8 |
Characters and Unicode
| Total characters | 748807 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A - Average |
|---|---|
| 2nd row | A - Average |
| 3rd row | A - Average |
| 4th row | A - Average |
| 5th row | A - Average |
Common Values
| Value | Count | Frequency (%) |
| A - Average | 56687 | |
| G - Good | 13194 | 7.2% |
| E - Excellent | 962 | 0.5% |
| F - Fair | 837 | 0.5% |
| P - Poor | 62 | < 0.1% |
| (Missing) | 110500 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 71742 | ||
| a | 56687 | |
| average | 56687 | |
| g | 13194 | 6.1% |
| good | 13194 | 6.1% |
| e | 962 | 0.4% |
| excellent | 962 | 0.4% |
| f | 837 | 0.4% |
| fair | 837 | 0.4% |
| p | 62 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 143484 | ||
| e | 115298 | |
| A | 113374 | |
| - | 71742 | |
| r | 57586 | |
| a | 57524 | |
| v | 56687 | 7.6% |
| g | 56687 | 7.6% |
| o | 26512 | 3.5% |
| G | 26388 | 3.5% |
| Other values (10) | 23525 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 748807 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 143484 | ||
| e | 115298 | |
| A | 113374 | |
| - | 71742 | |
| r | 57586 | |
| a | 57524 | |
| v | 56687 | 7.6% |
| g | 56687 | 7.6% |
| o | 26512 | 3.5% |
| G | 26388 | 3.5% |
| Other values (10) | 23525 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 748807 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 143484 | ||
| e | 115298 | |
| A | 113374 | |
| - | 71742 | |
| r | 57586 | |
| a | 57524 | |
| v | 56687 | 7.6% |
| g | 56687 | 7.6% |
| o | 26512 | 3.5% |
| G | 26388 | 3.5% |
| Other values (10) | 23525 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 748807 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 143484 | ||
| e | 115298 | |
| A | 113374 | |
| - | 71742 | |
| r | 57586 | |
| a | 57524 | |
| v | 56687 | 7.6% |
| g | 56687 | 7.6% |
| o | 26512 | 3.5% |
| G | 26388 | 3.5% |
| Other values (10) | 23525 | 3.1% |
BTHRM_STYLE1
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 49548 |
| Missing (%) | 27.2% |
| Memory size | 1.4 MiB |
| M - Modern | |
|---|---|
| S - Semi-Modern | |
| L - Luxury | |
| N - No Remodeling | 5662 |
Length
| Max length | 17 |
|---|---|
| Median length | 10 |
| Mean length | 12.505042 |
| Min length | 10 |
Characters and Unicode
| Total characters | 1659344 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S - Semi-Modern |
|---|---|
| 2nd row | M - Modern |
| 3rd row | M - Modern |
| 4th row | S - Semi-Modern |
| 5th row | N - No Remodeling |
Common Values
| Value | Count | Frequency (%) |
| M - Modern | 61020 | |
| S - Semi-Modern | 58554 | |
| L - Luxury | 7458 | 4.1% |
| N - No Remodeling | 5662 | 3.1% |
| (Missing) | 49548 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 132694 | ||
| m | 61020 | |
| modern | 61020 | |
| s | 58554 | |
| semi-modern | 58554 | |
| l | 7458 | 1.8% |
| luxury | 7458 | 1.8% |
| n | 5662 | 1.4% |
| no | 5662 | 1.4% |
| remodeling | 5662 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 271050 | ||
| - | 191248 | |
| e | 189452 | |
| M | 180594 | |
| o | 130898 | |
| r | 127032 | |
| d | 125236 | |
| n | 125236 | |
| S | 117108 | |
| i | 64216 | 3.9% |
| Other values (9) | 137274 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1659344 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 271050 | ||
| - | 191248 | |
| e | 189452 | |
| M | 180594 | |
| o | 130898 | |
| r | 127032 | |
| d | 125236 | |
| n | 125236 | |
| S | 117108 | |
| i | 64216 | 3.9% |
| Other values (9) | 137274 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1659344 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 271050 | ||
| - | 191248 | |
| e | 189452 | |
| M | 180594 | |
| o | 130898 | |
| r | 127032 | |
| d | 125236 | |
| n | 125236 | |
| S | 117108 | |
| i | 64216 | 3.9% |
| Other values (9) | 137274 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1659344 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 271050 | ||
| - | 191248 | |
| e | 189452 | |
| M | 180594 | |
| o | 130898 | |
| r | 127032 | |
| d | 125236 | |
| n | 125236 | |
| S | 117108 | |
| i | 64216 | 3.9% |
| Other values (9) | 137274 |
BTHRM_STYLE2
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 97077 |
| Missing (%) | 53.3% |
| Memory size | 1.4 MiB |
| M - Modern | |
|---|---|
| S - Semi-Modern | |
| N - No Remodeling | 3920 |
| L - Luxury | 3845 |
Length
| Max length | 17 |
|---|---|
| Median length | 10 |
| Mean length | 12.449598 |
| Min length | 10 |
Characters and Unicode
| Total characters | 1060270 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S - Semi-Modern |
|---|---|
| 2nd row | M - Modern |
| 3rd row | M - Modern |
| 4th row | S - Semi-Modern |
| 5th row | N - No Remodeling |
Common Values
| Value | Count | Frequency (%) |
| M - Modern | 41164 | |
| S - Semi-Modern | 36236 | 19.9% |
| N - No Remodeling | 3920 | 2.2% |
| L - Luxury | 3845 | 2.1% |
| (Missing) | 97077 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 85165 | ||
| m | 41164 | |
| modern | 41164 | |
| s | 36236 | |
| semi-modern | 36236 | |
| n | 3920 | 1.5% |
| no | 3920 | 1.5% |
| remodeling | 3920 | 1.5% |
| l | 3845 | 1.5% |
| luxury | 3845 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 174250 | ||
| e | 121476 | |
| - | 121401 | |
| M | 118564 | |
| o | 85240 | |
| d | 81320 | |
| n | 81320 | |
| r | 81245 | |
| S | 72472 | |
| i | 40156 | 3.8% |
| Other values (9) | 82826 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1060270 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 174250 | ||
| e | 121476 | |
| - | 121401 | |
| M | 118564 | |
| o | 85240 | |
| d | 81320 | |
| n | 81320 | |
| r | 81245 | |
| S | 72472 | |
| i | 40156 | 3.8% |
| Other values (9) | 82826 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1060270 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 174250 | ||
| e | 121476 | |
| - | 121401 | |
| M | 118564 | |
| o | 85240 | |
| d | 81320 | |
| n | 81320 | |
| r | 81245 | |
| S | 72472 | |
| i | 40156 | 3.8% |
| Other values (9) | 82826 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1060270 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 174250 | ||
| e | 121476 | |
| - | 121401 | |
| M | 118564 | |
| o | 85240 | |
| d | 81320 | |
| n | 81320 | |
| r | 81245 | |
| S | 72472 | |
| i | 40156 | 3.8% |
| Other values (9) | 82826 |
BTHRM_STYLE3
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 145740 |
| Missing (%) | 80.0% |
| Memory size | 1.4 MiB |
| M - Modern | |
|---|---|
| S - Semi-Modern | |
| N - No Remodeling | |
| L - Luxury |
Length
| Max length | 17 |
|---|---|
| Median length | 10 |
| Mean length | 12.302915 |
| Min length | 10 |
Characters and Unicode
| Total characters | 449081 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S - Semi-Modern |
|---|---|
| 2nd row | M - Modern |
| 3rd row | M - Modern |
| 4th row | S - Semi-Modern |
| 5th row | N - No Remodeling |
Common Values
| Value | Count | Frequency (%) |
| M - Modern | 18538 | 10.2% |
| S - Semi-Modern | 14078 | 7.7% |
| N - No Remodeling | 1953 | 1.1% |
| L - Luxury | 1933 | 1.1% |
| (Missing) | 145740 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 36502 | ||
| m | 18538 | |
| modern | 18538 | |
| s | 14078 | 12.6% |
| semi-modern | 14078 | 12.6% |
| n | 1953 | 1.8% |
| no | 1953 | 1.8% |
| remodeling | 1953 | 1.8% |
| l | 1933 | 1.7% |
| luxury | 1933 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 74957 | ||
| M | 51154 | |
| e | 50600 | |
| - | 50580 | |
| o | 36522 | |
| d | 34569 | |
| n | 34569 | |
| r | 34549 | |
| S | 28156 | 6.3% |
| i | 16031 | 3.6% |
| Other values (9) | 37394 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 449081 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 74957 | ||
| M | 51154 | |
| e | 50600 | |
| - | 50580 | |
| o | 36522 | |
| d | 34569 | |
| n | 34569 | |
| r | 34549 | |
| S | 28156 | 6.3% |
| i | 16031 | 3.6% |
| Other values (9) | 37394 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 449081 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 74957 | ||
| M | 51154 | |
| e | 50600 | |
| - | 50580 | |
| o | 36522 | |
| d | 34569 | |
| n | 34569 | |
| r | 34549 | |
| S | 28156 | 6.3% |
| i | 16031 | 3.6% |
| Other values (9) | 37394 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 449081 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 74957 | ||
| M | 51154 | |
| e | 50600 | |
| - | 50580 | |
| o | 36522 | |
| d | 34569 | |
| n | 34569 | |
| r | 34549 | |
| S | 28156 | 6.3% |
| i | 16031 | 3.6% |
| Other values (9) | 37394 |
KITCHEN_TYPE
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 49555 |
| Missing (%) | 27.2% |
| Memory size | 1.4 MiB |
| O - One Person | |
|---|---|
| 1F - 1 Full Eat In Kitchens | |
| F - Full Eat In | |
| 2F - 2 Full Eat In Kitchens | |
| 3F - 3 Full Eat In Kitchens | |
| Other values (5) |
Length
| Max length | 27 |
|---|---|
| Median length | 15 |
| Mean length | 20.065251 |
| Min length | 8 |
Characters and Unicode
| Total characters | 2662398 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 3F - 3 Full Eat In Kitchens |
|---|---|
| 2nd row | 3F - 3 Full Eat In Kitchens |
| 3rd row | 3F - 3 Full Eat In Kitchens |
| 4th row | 3F - 3 Full Eat In Kitchens |
| 5th row | 2F - 2 Full Eat In Kitchens |
Common Values
| Value | Count | Frequency (%) |
| O - One Person | 44878 | |
| 1F - 1 Full Eat In Kitchens | 29285 | |
| F - Full Eat In | 24250 | |
| 2F - 2 Full Eat In Kitchens | 16818 | 9.2% |
| 3F - 3 Full Eat In Kitchens | 11996 | 6.6% |
| P - Pullman | 2722 | 1.5% |
| 0F - 0 Full Eat In Kitchens | 2585 | 1.4% |
| N - None | 115 | 0.1% |
| 4F - 4 Full Eat In Kitchens | 37 | < 0.1% |
| 5F - 5 Full Eat In Kitchens | 1 | < 0.1% |
| (Missing) | 49555 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 132687 | ||
| full | 84972 | |
| eat | 84972 | |
| in | 84972 | |
| kitchens | 60722 | |
| o | 44878 | 6.1% |
| one | 44878 | 6.1% |
| person | 44878 | 6.1% |
| 1f | 29285 | 4.0% |
| 1 | 29285 | 4.0% |
| Other values (15) | 92798 |
Most occurring characters
| Value | Count | Frequency (%) |
| 601640 | ||
| n | 238287 | 9.0% |
| l | 175388 | 6.6% |
| F | 169944 | 6.4% |
| e | 150593 | 5.7% |
| t | 145694 | 5.5% |
| - | 132687 | 5.0% |
| s | 105600 | 4.0% |
| O | 89756 | 3.4% |
| u | 87694 | 3.3% |
| Other values (18) | 765115 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2662398 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 601640 | ||
| n | 238287 | 9.0% |
| l | 175388 | 6.6% |
| F | 169944 | 6.4% |
| e | 150593 | 5.7% |
| t | 145694 | 5.5% |
| - | 132687 | 5.0% |
| s | 105600 | 4.0% |
| O | 89756 | 3.4% |
| u | 87694 | 3.3% |
| Other values (18) | 765115 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2662398 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 601640 | ||
| n | 238287 | 9.0% |
| l | 175388 | 6.6% |
| F | 169944 | 6.4% |
| e | 150593 | 5.7% |
| t | 145694 | 5.5% |
| - | 132687 | 5.0% |
| s | 105600 | 4.0% |
| O | 89756 | 3.4% |
| u | 87694 | 3.3% |
| Other values (18) | 765115 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2662398 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 601640 | ||
| n | 238287 | 9.0% |
| l | 175388 | 6.6% |
| F | 169944 | 6.4% |
| e | 150593 | 5.7% |
| t | 145694 | 5.5% |
| - | 132687 | 5.0% |
| s | 105600 | 4.0% |
| O | 89756 | 3.4% |
| u | 87694 | 3.3% |
| Other values (18) | 765115 |
KITCHEN_STYLE1
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 49549 |
| Missing (%) | 27.2% |
| Memory size | 1.4 MiB |
| M - Modern | |
|---|---|
| S - Semi-Modern | |
| L - Luxury | |
| N - No Remodeling | 5849 |
Length
| Max length | 17 |
|---|---|
| Median length | 10 |
| Mean length | 12.357833 |
| Min length | 10 |
Characters and Unicode
| Total characters | 1639798 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S - Semi-Modern |
|---|---|
| 2nd row | M - Modern |
| 3rd row | S - Semi-Modern |
| 4th row | S - Semi-Modern |
| 5th row | N - No Remodeling |
Common Values
| Value | Count | Frequency (%) |
| M - Modern | 63927 | |
| S - Semi-Modern | 54385 | |
| L - Luxury | 8532 | 4.7% |
| N - No Remodeling | 5849 | 3.2% |
| (Missing) | 49549 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 132693 | ||
| m | 63927 | |
| modern | 63927 | |
| s | 54385 | |
| semi-modern | 54385 | |
| l | 8532 | 2.1% |
| luxury | 8532 | 2.1% |
| n | 5849 | 1.4% |
| no | 5849 | 1.4% |
| remodeling | 5849 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 271235 | ||
| - | 187078 | |
| e | 184395 | |
| M | 182239 | |
| o | 130010 | |
| r | 126844 | |
| d | 124161 | |
| n | 124161 | |
| S | 108770 | |
| i | 60234 | 3.7% |
| Other values (9) | 140671 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1639798 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 271235 | ||
| - | 187078 | |
| e | 184395 | |
| M | 182239 | |
| o | 130010 | |
| r | 126844 | |
| d | 124161 | |
| n | 124161 | |
| S | 108770 | |
| i | 60234 | 3.7% |
| Other values (9) | 140671 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1639798 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 271235 | ||
| - | 187078 | |
| e | 184395 | |
| M | 182239 | |
| o | 130010 | |
| r | 126844 | |
| d | 124161 | |
| n | 124161 | |
| S | 108770 | |
| i | 60234 | 3.7% |
| Other values (9) | 140671 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1639798 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 271235 | ||
| - | 187078 | |
| e | 184395 | |
| M | 182239 | |
| o | 130010 | |
| r | 126844 | |
| d | 124161 | |
| n | 124161 | |
| S | 108770 | |
| i | 60234 | 3.7% |
| Other values (9) | 140671 |
KITCHEN_STYLE2
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 150994 |
| Missing (%) | 82.9% |
| Memory size | 1.4 MiB |
| S - Semi-Modern | |
|---|---|
| M - Modern | |
| N - No Remodeling | |
| L - Luxury | 78 |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 13.551363 |
| Min length | 10 |
Characters and Unicode
| Total characters | 423453 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S - Semi-Modern |
|---|---|
| 2nd row | M - Modern |
| 3rd row | S - Semi-Modern |
| 4th row | S - Semi-Modern |
| 5th row | N - No Remodeling |
Common Values
| Value | Count | Frequency (%) |
| S - Semi-Modern | 18164 | 10.0% |
| M - Modern | 10127 | 5.6% |
| N - No Remodeling | 2879 | 1.6% |
| L - Luxury | 78 | < 0.1% |
| (Missing) | 150994 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 31248 | ||
| s | 18164 | |
| semi-modern | 18164 | |
| m | 10127 | 10.5% |
| modern | 10127 | 10.5% |
| n | 2879 | 3.0% |
| no | 2879 | 3.0% |
| remodeling | 2879 | 3.0% |
| l | 78 | 0.1% |
| luxury | 78 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 65375 | ||
| e | 52213 | |
| - | 49412 | |
| M | 38418 | |
| S | 36328 | |
| o | 34049 | |
| d | 31170 | |
| n | 31170 | |
| r | 28369 | |
| i | 21043 | 5.0% |
| Other values (9) | 35906 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 423453 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 65375 | ||
| e | 52213 | |
| - | 49412 | |
| M | 38418 | |
| S | 36328 | |
| o | 34049 | |
| d | 31170 | |
| n | 31170 | |
| r | 28369 | |
| i | 21043 | 5.0% |
| Other values (9) | 35906 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 423453 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 65375 | ||
| e | 52213 | |
| - | 49412 | |
| M | 38418 | |
| S | 36328 | |
| o | 34049 | |
| d | 31170 | |
| n | 31170 | |
| r | 28369 | |
| i | 21043 | 5.0% |
| Other values (9) | 35906 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 423453 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 65375 | ||
| e | 52213 | |
| - | 49412 | |
| M | 38418 | |
| S | 36328 | |
| o | 34049 | |
| d | 31170 | |
| n | 31170 | |
| r | 28369 | |
| i | 21043 | 5.0% |
| Other values (9) | 35906 |
KITCHEN_STYLE3
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 168497 |
| Missing (%) | 92.5% |
| Memory size | 1.4 MiB |
| S - Semi-Modern | |
|---|---|
| M - Modern | |
| N - No Remodeling | |
| L - Luxury | 29 |
Length
| Max length | 17 |
|---|---|
| Median length | 15 |
| Mean length | 13.525209 |
| Min length | 10 |
Characters and Unicode
| Total characters | 185904 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | S - Semi-Modern |
|---|---|
| 2nd row | M - Modern |
| 3rd row | S - Semi-Modern |
| 4th row | S - Semi-Modern |
| 5th row | M - Modern |
Common Values
| Value | Count | Frequency (%) |
| S - Semi-Modern | 7777 | 4.3% |
| M - Modern | 4572 | 2.5% |
| N - No Remodeling | 1367 | 0.8% |
| L - Luxury | 29 | < 0.1% |
| (Missing) | 168497 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 13745 | ||
| s | 7777 | |
| semi-modern | 7777 | |
| m | 4572 | 10.7% |
| modern | 4572 | 10.7% |
| n | 1367 | 3.2% |
| no | 1367 | 3.2% |
| remodeling | 1367 | 3.2% |
| l | 29 | 0.1% |
| luxury | 29 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 28857 | ||
| e | 22860 | |
| - | 21522 | |
| M | 16921 | |
| S | 15554 | |
| o | 15083 | |
| d | 13716 | |
| n | 13716 | |
| r | 12378 | |
| i | 9144 | 4.9% |
| Other values (9) | 16153 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 185904 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 28857 | ||
| e | 22860 | |
| - | 21522 | |
| M | 16921 | |
| S | 15554 | |
| o | 15083 | |
| d | 13716 | |
| n | 13716 | |
| r | 12378 | |
| i | 9144 | 4.9% |
| Other values (9) | 16153 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 185904 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 28857 | ||
| e | 22860 | |
| - | 21522 | |
| M | 16921 | |
| S | 15554 | |
| o | 15083 | |
| d | 13716 | |
| n | 13716 | |
| r | 12378 | |
| i | 9144 | 4.9% |
| Other values (9) | 16153 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 185904 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 28857 | ||
| e | 22860 | |
| - | 21522 | |
| M | 16921 | |
| S | 15554 | |
| o | 15083 | |
| d | 13716 | |
| n | 13716 | |
| r | 12378 | |
| i | 9144 | 4.9% |
| Other values (9) | 16153 |
HEAT_TYPE
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 48242 |
| Missing (%) | 26.5% |
| Memory size | 1.4 MiB |
| W - Ht Water/Steam | |
|---|---|
| F - Forced Hot Air | |
| E - Electric | 6023 |
| P - Heat Pump | 4654 |
| S - Space Heat | 652 |
| Other values (2) | 129 |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 17.527873 |
| Min length | 8 |
Characters and Unicode
| Total characters | 2348735 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | W - Ht Water/Steam |
|---|---|
| 2nd row | F - Forced Hot Air |
| 3rd row | S - Space Heat |
| 4th row | W - Ht Water/Steam |
| 5th row | W - Ht Water/Steam |
Common Values
| Value | Count | Frequency (%) |
| W - Ht Water/Steam | 71163 | |
| F - Forced Hot Air | 51379 | |
| E - Electric | 6023 | 3.3% |
| P - Heat Pump | 4654 | 2.6% |
| S - Space Heat | 652 | 0.4% |
| N - None | 88 | < 0.1% |
| O - Other | 41 | < 0.1% |
| (Missing) | 48242 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 134000 | ||
| w | 71163 | |
| ht | 71163 | |
| water/steam | 71163 | |
| f | 51379 | 8.8% |
| forced | 51379 | 8.8% |
| hot | 51379 | 8.8% |
| air | 51379 | 8.8% |
| electric | 6023 | 1.0% |
| e | 6023 | 1.0% |
| Other values (9) | 16176 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 447227 | ||
| t | 276238 | |
| e | 205815 | |
| r | 179985 | 7.7% |
| a | 148284 | 6.3% |
| W | 142326 | 6.1% |
| - | 134000 | 5.7% |
| H | 127848 | 5.4% |
| o | 102846 | 4.4% |
| F | 102758 | 4.4% |
| Other values (16) | 481408 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2348735 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 447227 | ||
| t | 276238 | |
| e | 205815 | |
| r | 179985 | 7.7% |
| a | 148284 | 6.3% |
| W | 142326 | 6.1% |
| - | 134000 | 5.7% |
| H | 127848 | 5.4% |
| o | 102846 | 4.4% |
| F | 102758 | 4.4% |
| Other values (16) | 481408 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2348735 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 447227 | ||
| t | 276238 | |
| e | 205815 | |
| r | 179985 | 7.7% |
| a | 148284 | 6.3% |
| W | 142326 | 6.1% |
| - | 134000 | 5.7% |
| H | 127848 | 5.4% |
| o | 102846 | 4.4% |
| F | 102758 | 4.4% |
| Other values (16) | 481408 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2348735 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 447227 | ||
| t | 276238 | |
| e | 205815 | |
| r | 179985 | 7.7% |
| a | 148284 | 6.3% |
| W | 142326 | 6.1% |
| - | 134000 | 5.7% |
| H | 127848 | 5.4% |
| o | 102846 | 4.4% |
| F | 102758 | 4.4% |
| Other values (16) | 481408 |
HEAT_SYSTEM
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 110013 |
| Missing (%) | 60.4% |
| Memory size | 1.4 MiB |
| I - Indiv. Cntrl | |
|---|---|
| Y - Self Contained | |
| C - Common | |
| N - None | 969 |
| 1 | 58 |
Length
| Max length | 18 |
|---|---|
| Median length | 16 |
| Mean length | 15.746196 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1137332 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | I - Indiv. Cntrl |
|---|---|
| 2nd row | I - Indiv. Cntrl |
| 3rd row | I - Indiv. Cntrl |
| 4th row | I - Indiv. Cntrl |
| 5th row | Y - Self Contained |
Common Values
| Value | Count | Frequency (%) |
| I - Indiv. Cntrl | 43889 | 24.1% |
| Y - Self Contained | 19271 | 10.6% |
| C - Common | 8042 | 4.4% |
| N - None | 969 | 0.5% |
| 1 | 58 | < 0.1% |
| (Missing) | 110013 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 72171 | ||
| i | 43889 | |
| indiv | 43889 | |
| cntrl | 43889 | |
| y | 19271 | 6.9% |
| self | 19271 | 6.9% |
| contained | 19271 | 6.9% |
| c | 8042 | 2.9% |
| common | 8042 | 2.9% |
| n | 969 | 0.3% |
| Other values (2) | 1027 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 207502 | ||
| n | 135331 | |
| I | 87778 | 7.7% |
| C | 79244 | 7.0% |
| - | 72171 | 6.3% |
| t | 63160 | 5.6% |
| d | 63160 | 5.6% |
| i | 63160 | 5.6% |
| l | 63160 | 5.6% |
| r | 43889 | 3.9% |
| Other values (11) | 258777 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1137332 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 207502 | ||
| n | 135331 | |
| I | 87778 | 7.7% |
| C | 79244 | 7.0% |
| - | 72171 | 6.3% |
| t | 63160 | 5.6% |
| d | 63160 | 5.6% |
| i | 63160 | 5.6% |
| l | 63160 | 5.6% |
| r | 43889 | 3.9% |
| Other values (11) | 258777 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1137332 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 207502 | ||
| n | 135331 | |
| I | 87778 | 7.7% |
| C | 79244 | 7.0% |
| - | 72171 | 6.3% |
| t | 63160 | 5.6% |
| d | 63160 | 5.6% |
| i | 63160 | 5.6% |
| l | 63160 | 5.6% |
| r | 43889 | 3.9% |
| Other values (11) | 258777 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1137332 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 207502 | ||
| n | 135331 | |
| I | 87778 | 7.7% |
| C | 79244 | 7.0% |
| - | 72171 | 6.3% |
| t | 63160 | 5.6% |
| d | 63160 | 5.6% |
| i | 63160 | 5.6% |
| l | 63160 | 5.6% |
| r | 43889 | 3.9% |
| Other values (11) | 258777 |
AC_TYPE
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 48272 |
| Missing (%) | 26.5% |
| Memory size | 1.4 MiB |
| N - None | |
|---|---|
| C - Central AC | |
| D - Ductless AC | 1377 |
| Y - Yes | 1 |
Length
| Max length | 15 |
|---|---|
| Median length | 8 |
| Mean length | 10.487572 |
| Min length | 7 |
Characters and Unicode
| Total characters | 1405020 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | N - None |
|---|---|
| 2nd row | C - Central AC |
| 3rd row | N - None |
| 4th row | N - None |
| 5th row | N - None |
Common Values
| Value | Count | Frequency (%) |
| N - None | 78655 | |
| C - Central AC | 53937 | |
| D - Ductless AC | 1377 | 0.8% |
| Y - Yes | 1 | < 0.1% |
| (Missing) | 48272 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 133970 | ||
| n | 78655 | |
| none | 78655 | |
| ac | 55314 | |
| c | 53937 | |
| central | 53937 | |
| d | 1377 | 0.3% |
| ductless | 1377 | 0.3% |
| y | 1 | < 0.1% |
| yes | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 323254 | ||
| C | 163188 | |
| N | 157310 | |
| - | 133970 | |
| e | 133970 | |
| n | 132592 | |
| o | 78655 | 5.6% |
| l | 55314 | 3.9% |
| t | 55314 | 3.9% |
| A | 55314 | 3.9% |
| Other values (7) | 116139 | 8.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1405020 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 323254 | ||
| C | 163188 | |
| N | 157310 | |
| - | 133970 | |
| e | 133970 | |
| n | 132592 | |
| o | 78655 | 5.6% |
| l | 55314 | 3.9% |
| t | 55314 | 3.9% |
| A | 55314 | 3.9% |
| Other values (7) | 116139 | 8.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1405020 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 323254 | ||
| C | 163188 | |
| N | 157310 | |
| - | 133970 | |
| e | 133970 | |
| n | 132592 | |
| o | 78655 | 5.6% |
| l | 55314 | 3.9% |
| t | 55314 | 3.9% |
| A | 55314 | 3.9% |
| Other values (7) | 116139 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1405020 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 323254 | ||
| C | 163188 | |
| N | 157310 | |
| - | 133970 | |
| e | 133970 | |
| n | 132592 | |
| o | 78655 | 5.6% |
| l | 55314 | 3.9% |
| t | 55314 | 3.9% |
| A | 55314 | 3.9% |
| Other values (7) | 116139 | 8.3% |
FIREPLACES
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 49534 |
| Missing (%) | 27.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.34416162 |
| Minimum | 0 |
|---|---|
| Maximum | 12 |
| Zeros | 96980 |
| Zeros (%) | 53.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 12 |
| Range | 12 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.68600397 |
|---|---|
| Coefficient of variation (CV) | 1.9932611 |
| Kurtosis | 22.273269 |
| Mean | 0.34416162 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.4407165 |
| Sum | 45673 |
| Variance | 0.47060145 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 96980 | |
| 1 | 28983 | 15.9% |
| 2 | 5085 | 2.8% |
| 3 | 917 | 0.5% |
| 4 | 359 | 0.2% |
| 5 | 167 | 0.1% |
| 6 | 114 | 0.1% |
| 7 | 48 | < 0.1% |
| 8 | 34 | < 0.1% |
| 9 | 12 | < 0.1% |
| Other values (3) | 9 | < 0.1% |
| (Missing) | 49534 |
| Value | Count | Frequency (%) |
| 0 | 96980 | |
| 1 | 28983 | 15.9% |
| 2 | 5085 | 2.8% |
| 3 | 917 | 0.5% |
| 4 | 359 | 0.2% |
| 5 | 167 | 0.1% |
| 6 | 114 | 0.1% |
| 7 | 48 | < 0.1% |
| 8 | 34 | < 0.1% |
| 9 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 12 | 2 | < 0.1% |
| 11 | 4 | < 0.1% |
| 10 | 3 | < 0.1% |
| 9 | 12 | < 0.1% |
| 8 | 34 | < 0.1% |
| 7 | 48 | < 0.1% |
| 6 | 114 | 0.1% |
| 5 | 167 | 0.1% |
| 4 | 359 | 0.2% |
| 3 | 917 |
ORIENTATION
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 110268 |
| Missing (%) | 60.5% |
| Memory size | 1.4 MiB |
| T - Through | |
|---|---|
| F - Front/Street | |
| A - Rear Above | |
| M - Middle | 2724 |
| C - Courtyard | 1761 |
| Other values (2) | 1513 |
Length
| Max length | 16 |
|---|---|
| Median length | 11 |
| Mean length | 12.732112 |
| Min length | 7 |
Characters and Unicode
| Total characters | 916381 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | T - Through |
|---|---|
| 2nd row | T - Through |
| 3rd row | T - Through |
| 4th row | T - Through |
| 5th row | T - Through |
Common Values
| Value | Count | Frequency (%) |
| T - Through | 37600 | 20.6% |
| F - Front/Street | 17990 | 9.9% |
| A - Rear Above | 10386 | 5.7% |
| M - Middle | 2724 | 1.5% |
| C - Courtyard | 1761 | 1.0% |
| B - Rear Below | 1259 | 0.7% |
| E - End | 254 | 0.1% |
| (Missing) | 110268 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 71974 | ||
| t | 37600 | |
| through | 37600 | |
| f | 17990 | 7.9% |
| front/street | 17990 | 7.9% |
| rear | 11645 | 5.1% |
| a | 10386 | 4.6% |
| above | 10386 | 4.6% |
| m | 2724 | 1.2% |
| middle | 2724 | 1.2% |
| Other values (6) | 6548 | 2.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 155593 | ||
| r | 88747 | |
| T | 75200 | |
| h | 75200 | |
| - | 71974 | 7.9% |
| o | 68996 | 7.5% |
| e | 61994 | 6.8% |
| t | 55731 | 6.1% |
| u | 39361 | 4.3% |
| g | 37600 | 4.1% |
| Other values (18) | 185985 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 916381 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 155593 | ||
| r | 88747 | |
| T | 75200 | |
| h | 75200 | |
| - | 71974 | 7.9% |
| o | 68996 | 7.5% |
| e | 61994 | 6.8% |
| t | 55731 | 6.1% |
| u | 39361 | 4.3% |
| g | 37600 | 4.1% |
| Other values (18) | 185985 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 916381 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 155593 | ||
| r | 88747 | |
| T | 75200 | |
| h | 75200 | |
| - | 71974 | 7.9% |
| o | 68996 | 7.5% |
| e | 61994 | 6.8% |
| t | 55731 | 6.1% |
| u | 39361 | 4.3% |
| g | 37600 | 4.1% |
| Other values (18) | 185985 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 916381 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 155593 | ||
| r | 88747 | |
| T | 75200 | |
| h | 75200 | |
| - | 71974 | 7.9% |
| o | 68996 | 7.5% |
| e | 61994 | 6.8% |
| t | 55731 | 6.1% |
| u | 39361 | 4.3% |
| g | 37600 | 4.1% |
| Other values (18) | 185985 |
NUM_PARKING
Real number (ℝ)
HIGH CORRELATION  MISSING  SKEWED  ZEROS 
| Distinct | 23 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 48623 |
| Missing (%) | 26.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3289427 |
| Minimum | 0 |
|---|---|
| Maximum | 210 |
| Zeros | 58524 |
| Zeros (%) | 32.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 210 |
| Range | 210 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.411295 |
|---|---|
| Coefficient of variation (CV) | 1.8144461 |
| Kurtosis | 1663.0013 |
| Mean | 1.3289427 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 29.679935 |
| Sum | 177572 |
| Variance | 5.8143437 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 58524 | |
| 1 | 29227 | |
| 2 | 21830 | 12.0% |
| 3 | 8748 | 4.8% |
| 4 | 8222 | 4.5% |
| 6 | 3011 | 1.7% |
| 5 | 2636 | 1.4% |
| 8 | 631 | 0.3% |
| 7 | 554 | 0.3% |
| 10 | 95 | 0.1% |
| Other values (13) | 141 | 0.1% |
| (Missing) | 48623 |
| Value | Count | Frequency (%) |
| 0 | 58524 | |
| 1 | 29227 | |
| 2 | 21830 | 12.0% |
| 3 | 8748 | 4.8% |
| 4 | 8222 | 4.5% |
| 5 | 2636 | 1.4% |
| 6 | 3011 | 1.7% |
| 7 | 554 | 0.3% |
| 8 | 631 | 0.3% |
| 9 | 88 | < 0.1% |
| Value | Count | Frequency (%) |
| 210 | 1 | < 0.1% |
| 125 | 24 | |
| 56 | 1 | < 0.1% |
| 22 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 17 | 1 | < 0.1% |
| 16 | 3 | < 0.1% |
| 14 | 5 | < 0.1% |
| 13 | 2 | < 0.1% |
PROP_VIEW
Categorical
HIGH CORRELATION  IMBALANCE  MISSING 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 46953 |
| Missing (%) | 25.8% |
| Memory size | 1.4 MiB |
| A - Average | |
|---|---|
| G - Good | |
| F - Fair | 6033 |
| E - Excellent | 4405 |
| P - Poor | 525 |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 10.629519 |
| Min length | 8 |
Characters and Unicode
| Total characters | 1438057 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A - Average |
|---|---|
| 2nd row | A - Average |
| 3rd row | A - Average |
| 4th row | A - Average |
| 5th row | A - Average |
Common Values
| Value | Count | Frequency (%) |
| A - Average | 110895 | |
| G - Good | 13086 | 7.2% |
| F - Fair | 6033 | 3.3% |
| E - Excellent | 4405 | 2.4% |
| P - Poor | 525 | 0.3% |
| S - Special | 345 | 0.2% |
| (Missing) | 46953 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 135289 | ||
| a | 110895 | |
| average | 110895 | |
| g | 13086 | 3.2% |
| good | 13086 | 3.2% |
| f | 6033 | 1.5% |
| fair | 6033 | 1.5% |
| e | 4405 | 1.1% |
| excellent | 4405 | 1.1% |
| p | 525 | 0.1% |
| Other values (3) | 1215 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 270578 | ||
| e | 230945 | |
| A | 221790 | |
| - | 135289 | |
| r | 117453 | |
| a | 117273 | |
| v | 110895 | |
| g | 110895 | |
| o | 27222 | 1.9% |
| G | 26172 | 1.8% |
| Other values (12) | 69545 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1438057 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 270578 | ||
| e | 230945 | |
| A | 221790 | |
| - | 135289 | |
| r | 117453 | |
| a | 117273 | |
| v | 110895 | |
| g | 110895 | |
| o | 27222 | 1.9% |
| G | 26172 | 1.8% |
| Other values (12) | 69545 | 4.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1438057 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 270578 | ||
| e | 230945 | |
| A | 221790 | |
| - | 135289 | |
| r | 117453 | |
| a | 117273 | |
| v | 110895 | |
| g | 110895 | |
| o | 27222 | 1.9% |
| G | 26172 | 1.8% |
| Other values (12) | 69545 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1438057 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 270578 | ||
| e | 230945 | |
| A | 221790 | |
| - | 135289 | |
| r | 117453 | |
| a | 117273 | |
| v | 110895 | |
| g | 110895 | |
| o | 27222 | 1.9% |
| G | 26172 | 1.8% |
| Other values (12) | 69545 | 4.8% |
CORNER_UNIT
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 110271 |
| Missing (%) | 60.5% |
| Memory size | 1.4 MiB |
| N - No | |
|---|---|
| Y - Yes |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 6.1921746 |
| Min length | 6 |
Characters and Unicode
| Total characters | 445657 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | N - No |
|---|---|
| 2nd row | N - No |
| 3rd row | N - No |
| 4th row | N - No |
| 5th row | N - No |
Common Values
| Value | Count | Frequency (%) |
| N - No | 58140 | |
| Y - Yes | 13831 | 7.6% |
| (Missing) | 110271 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 71971 | ||
| n | 58140 | |
| no | 58140 | |
| y | 13831 | 6.4% |
| yes | 13831 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 143942 | ||
| N | 116280 | |
| - | 71971 | |
| o | 58140 | |
| Y | 27662 | 6.2% |
| e | 13831 | 3.1% |
| s | 13831 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 445657 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 143942 | ||
| N | 116280 | |
| - | 71971 | |
| o | 58140 | |
| Y | 27662 | 6.2% |
| e | 13831 | 3.1% |
| s | 13831 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 445657 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 143942 | ||
| N | 116280 | |
| - | 71971 | |
| o | 58140 | |
| Y | 27662 | 6.2% |
| e | 13831 | 3.1% |
| s | 13831 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 445657 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 143942 | ||
| N | 116280 | |
| - | 71971 | |
| o | 58140 | |
| Y | 27662 | 6.2% |
| e | 13831 | 3.1% |
| s | 13831 | 3.1% |
| AC_TYPE | BDRM_COND | BED_RMS | BTHRM_STYLE1 | BTHRM_STYLE2 | BTHRM_STYLE3 | CD_FLOOR | CITY | CM_ID | COM_UNITS | CORNER_UNIT | EXT_COND | EXT_FNISHED | FIREPLACES | FULL_BTH | GIS_ID | GROSS_AREA | HEAT_SYSTEM | HEAT_TYPE | HLF_BTH | INT_COND | INT_WALL | KITCHENS | KITCHEN_STYLE1 | KITCHEN_STYLE2 | KITCHEN_STYLE3 | KITCHEN_TYPE | LIVING_AREA | LU | LUC | NUM_BLDGS | NUM_PARKING | ORIENTATION | OVERALL_COND | OWN_OCC | PID | PROP_VIEW | RC_UNITS | RES_FLOOR | RES_UNITS | ROOF_COVER | ROOF_STRUCTURE | STRUCTURE_CLASS | ST_NUM | TT_RMS | YR_BUILT | YR_REMODEL | ZIP_CODE | _id | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AC_TYPE | 1.000 | 0.089 | 0.182 | 0.277 | 0.359 | 0.374 | 0.143 | 0.208 | 0.190 | 1.000 | 0.048 | 0.372 | 0.304 | 0.036 | 0.069 | 0.203 | 1.000 | 0.193 | 0.342 | 0.070 | 0.319 | 0.116 | 0.242 | 0.275 | 0.250 | 0.251 | 0.266 | 1.000 | 0.245 | 0.016 | 0.005 | 0.000 | 0.077 | 0.208 | 0.030 | 0.203 | 0.171 | 1.000 | 0.000 | 1.000 | 0.220 | 0.174 | 0.146 | 0.032 | 0.195 | 0.000 | 0.006 | 0.119 | 0.213 |
| BDRM_COND | 0.089 | 1.000 | 0.026 | 0.191 | 0.214 | 0.212 | 0.247 | 0.148 | 0.127 | 0.000 | 0.142 | 0.162 | 0.276 | 0.051 | 0.064 | 0.127 | 1.000 | 0.072 | 0.101 | 0.069 | 0.145 | 0.189 | 0.000 | 0.188 | 0.479 | 1.000 | 0.094 | 1.000 | 0.004 | 0.009 | 0.000 | 1.000 | 0.101 | 0.181 | 0.074 | 0.127 | 0.188 | 0.000 | 0.022 | 0.000 | 0.089 | 0.055 | 0.364 | 0.035 | 0.127 | 0.000 | 0.000 | 0.127 | 0.155 |
| BED_RMS | 0.182 | 0.026 | 1.000 | 0.107 | 0.190 | 0.237 | -0.189 | 0.206 | 0.150 | NaN | 0.027 | 0.158 | 0.200 | 0.034 | 0.659 | 0.261 | 0.898 | 0.051 | 0.108 | 0.171 | 0.142 | 0.044 | 0.696 | 0.113 | 0.062 | 0.083 | 0.355 | 0.873 | 0.351 | 0.282 | 0.003 | 0.449 | 0.125 | 0.044 | 0.239 | 0.261 | 0.100 | NaN | 0.742 | NaN | 0.211 | 0.221 | 0.138 | -0.146 | 0.940 | -0.147 | 0.277 | 0.123 | 0.261 |
| BTHRM_STYLE1 | 0.277 | 0.191 | 0.107 | 1.000 | 0.756 | 0.668 | 0.231 | 0.234 | 0.195 | 0.000 | 0.119 | 0.350 | 0.338 | 0.080 | 0.057 | 0.195 | 1.000 | 0.156 | 0.218 | 0.088 | 0.525 | 0.355 | 0.118 | 0.769 | 0.542 | 0.552 | 0.177 | 1.000 | 0.168 | 1.000 | 0.015 | 0.012 | 0.125 | 0.330 | 0.057 | 0.195 | 0.228 | 0.000 | 0.012 | 0.000 | 0.198 | 0.138 | 0.307 | 0.060 | 0.122 | 0.000 | 0.006 | 0.159 | 0.218 |
| BTHRM_STYLE2 | 0.359 | 0.214 | 0.190 | 0.756 | 1.000 | 0.820 | 0.265 | 0.279 | 0.183 | 0.000 | 0.145 | 0.374 | 0.372 | 0.038 | 0.063 | 0.218 | 1.000 | 0.124 | 0.220 | 0.055 | 0.499 | 0.351 | 0.185 | 0.658 | 0.678 | 0.638 | 0.261 | 1.000 | 0.260 | 1.000 | 0.021 | 0.000 | 0.146 | 0.325 | 0.084 | 0.218 | 0.252 | 0.000 | 0.012 | 0.000 | 0.215 | 0.187 | 0.313 | 0.039 | 0.207 | 0.000 | 0.000 | 0.209 | 0.258 |
| BTHRM_STYLE3 | 0.374 | 0.212 | 0.237 | 0.668 | 0.820 | 1.000 | 0.294 | 0.298 | 0.223 | 0.000 | 0.216 | 0.407 | 0.385 | 0.054 | 0.130 | 0.209 | 1.000 | 0.126 | 0.216 | 0.174 | 0.506 | 0.357 | 0.255 | 0.613 | 0.589 | 0.713 | 0.309 | 1.000 | 0.305 | 1.000 | 0.034 | 0.000 | 0.208 | 0.346 | 0.080 | 0.209 | 0.294 | 0.000 | 0.014 | 0.000 | 0.195 | 0.138 | 0.413 | 0.040 | 0.271 | 1.000 | 0.008 | 0.239 | 0.273 |
| CD_FLOOR | 0.143 | 0.247 | -0.189 | 0.231 | 0.265 | 0.294 | 1.000 | 0.114 | -0.260 | NaN | 0.169 | 0.205 | 0.211 | -0.103 | 0.032 | -0.260 | -0.061 | 0.213 | 0.061 | -0.077 | 0.162 | 0.315 | -0.010 | 0.220 | 1.000 | 1.000 | 0.079 | -0.051 | 1.000 | NaN | 0.004 | -0.102 | 0.121 | 0.169 | 0.097 | -0.260 | 0.403 | NaN | -0.382 | NaN | 0.181 | 0.124 | 0.389 | 0.021 | -0.216 | 0.184 | -0.070 | -0.232 | -0.260 |
| CITY | 0.208 | 0.148 | 0.206 | 0.234 | 0.279 | 0.298 | 0.114 | 1.000 | 0.660 | 0.000 | 0.150 | 0.195 | 0.196 | 0.045 | 0.078 | 0.681 | 0.010 | 0.154 | 0.137 | 0.074 | 0.217 | 0.166 | 0.183 | 0.229 | 0.126 | 0.099 | 0.252 | 0.010 | 0.175 | 0.072 | 0.006 | 0.026 | 0.169 | 0.080 | 0.261 | 0.681 | 0.177 | 0.000 | 0.021 | 0.084 | 0.265 | 0.265 | 0.259 | 0.176 | 0.220 | 0.000 | 0.021 | 0.694 | 0.752 |
| CM_ID | 0.190 | 0.127 | 0.150 | 0.195 | 0.183 | 0.223 | -0.260 | 0.660 | 1.000 | -0.125 | 0.143 | 0.147 | 0.227 | -0.104 | 0.011 | 1.000 | -0.008 | 0.162 | 0.145 | -0.014 | 0.194 | 0.119 | 0.135 | 0.195 | 0.561 | 0.217 | 0.140 | -0.014 | 0.078 | -0.048 | 0.000 | 0.230 | 0.135 | 0.099 | 0.153 | 1.000 | 0.152 | -0.004 | 0.028 | -0.169 | 0.237 | 0.219 | 0.341 | 0.066 | 0.135 | 0.101 | -0.023 | 0.582 | 1.000 |
| COM_UNITS | 1.000 | 0.000 | NaN | 0.000 | 0.000 | 0.000 | NaN | 0.000 | -0.125 | 1.000 | 0.000 | 0.016 | 0.000 | NaN | NaN | -0.125 | 0.307 | 1.000 | 1.000 | NaN | 0.000 | 0.000 | NaN | 0.000 | 0.000 | 0.000 | 0.000 | NaN | 1.000 | NaN | 0.000 | NaN | 0.000 | 0.000 | 1.000 | -0.125 | 1.000 | 0.093 | 0.248 | 0.073 | 0.000 | 0.000 | 1.000 | 0.145 | NaN | 0.105 | NaN | -0.137 | -0.125 |
| CORNER_UNIT | 0.048 | 0.142 | 0.027 | 0.119 | 0.145 | 0.216 | 0.169 | 0.150 | 0.143 | 0.000 | 1.000 | 0.136 | 0.208 | 0.044 | 0.050 | 0.143 | 1.000 | 0.063 | 0.034 | 0.042 | 0.127 | 0.094 | 0.000 | 0.112 | 0.410 | 0.524 | 0.074 | 1.000 | 1.000 | 1.000 | 0.000 | 1.000 | 0.295 | 0.098 | 0.036 | 0.143 | 0.169 | 0.000 | 0.000 | 0.000 | 0.144 | 0.146 | 0.046 | 0.062 | 0.085 | 0.000 | 0.000 | 0.101 | 0.169 |
| EXT_COND | 0.372 | 0.162 | 0.158 | 0.350 | 0.374 | 0.407 | 0.205 | 0.195 | 0.147 | 0.016 | 0.136 | 1.000 | 0.385 | 0.038 | 0.058 | 0.184 | 0.000 | 0.095 | 0.209 | 0.038 | 0.428 | 0.274 | 0.163 | 0.343 | 0.222 | 0.211 | 0.227 | 0.000 | 0.233 | 0.047 | 0.011 | 0.037 | 0.127 | 0.398 | 0.070 | 0.184 | 0.205 | 0.000 | 0.030 | 0.077 | 0.219 | 0.170 | 0.284 | 0.044 | 0.166 | 0.009 | 0.000 | 0.099 | 0.193 |
| EXT_FNISHED | 0.304 | 0.276 | 0.200 | 0.338 | 0.372 | 0.385 | 0.211 | 0.196 | 0.227 | 0.000 | 0.208 | 0.385 | 1.000 | 0.035 | 0.087 | 0.229 | 0.125 | 0.178 | 0.155 | 0.093 | 0.305 | 0.354 | 0.463 | 0.329 | 0.145 | 0.136 | 0.236 | 0.130 | 0.331 | 0.344 | 0.000 | 0.584 | 0.213 | 0.203 | 0.334 | 0.229 | 0.211 | 0.000 | 0.168 | 0.131 | 0.303 | 0.259 | 0.596 | 0.075 | 0.218 | 0.000 | 0.036 | 0.218 | 0.244 |
| FIREPLACES | 0.036 | 0.051 | 0.034 | 0.080 | 0.038 | 0.054 | -0.103 | 0.045 | -0.104 | NaN | 0.044 | 0.038 | 0.035 | 1.000 | 0.044 | 0.018 | 0.103 | 0.032 | 0.021 | 0.253 | 0.048 | 0.129 | -0.181 | 0.087 | 0.123 | 0.117 | 0.044 | 0.103 | 0.070 | -0.283 | 0.002 | 0.125 | 0.049 | 0.091 | 0.038 | 0.018 | 0.029 | NaN | 0.057 | NaN | 0.082 | 0.061 | 0.099 | 0.000 | 0.054 | -0.062 | 0.104 | -0.011 | 0.018 |
| FULL_BTH | 0.069 | 0.064 | 0.659 | 0.057 | 0.063 | 0.130 | 0.032 | 0.078 | 0.011 | NaN | 0.050 | 0.058 | 0.087 | 0.044 | 1.000 | 0.062 | 0.311 | 0.024 | 0.055 | 0.193 | 0.047 | 0.044 | 0.859 | 0.058 | 0.150 | 0.177 | 0.245 | 0.343 | 0.250 | -0.256 | 0.017 | 0.233 | 0.030 | 0.047 | 0.045 | 0.062 | 0.040 | NaN | 0.433 | NaN | 0.019 | 0.011 | 0.065 | -0.112 | 0.664 | -0.060 | 0.233 | 0.052 | 0.062 |
| GIS_ID | 0.203 | 0.127 | 0.261 | 0.195 | 0.218 | 0.209 | -0.260 | 0.681 | 1.000 | -0.125 | 0.143 | 0.184 | 0.229 | 0.018 | 0.062 | 1.000 | 0.194 | 0.162 | 0.132 | 0.081 | 0.202 | 0.145 | 0.126 | 0.191 | 0.133 | 0.094 | 0.208 | 0.116 | 0.191 | -0.125 | 0.002 | 0.438 | 0.135 | 0.070 | 0.197 | 1.000 | 0.161 | -0.004 | 0.034 | -0.169 | 0.262 | 0.258 | 0.247 | -0.058 | 0.255 | 0.150 | -0.023 | 0.608 | 1.000 |
| GROSS_AREA | 1.000 | 1.000 | 0.898 | 1.000 | 1.000 | 1.000 | -0.061 | 0.010 | -0.008 | 0.307 | 1.000 | 0.000 | 0.125 | 0.103 | 0.311 | 0.194 | 1.000 | 1.000 | 1.000 | 0.141 | 1.000 | 1.000 | 0.245 | 1.000 | 1.000 | 1.000 | 1.000 | 0.970 | 0.052 | 0.303 | 0.000 | 0.519 | 1.000 | 0.051 | 0.025 | 0.194 | 0.006 | 0.143 | 0.775 | 0.252 | 0.002 | 0.000 | 0.102 | -0.089 | 0.925 | -0.104 | 0.243 | 0.070 | 0.194 |
| HEAT_SYSTEM | 0.193 | 0.072 | 0.051 | 0.156 | 0.124 | 0.126 | 0.213 | 0.154 | 0.162 | 1.000 | 0.063 | 0.095 | 0.178 | 0.032 | 0.024 | 0.162 | 1.000 | 1.000 | 0.232 | 0.063 | 0.109 | 0.102 | 0.000 | 0.137 | 0.234 | 0.000 | 0.041 | 1.000 | 0.020 | 0.020 | 0.013 | 1.000 | 0.090 | 0.116 | 0.122 | 0.162 | 0.083 | 1.000 | 0.000 | 1.000 | 0.166 | 0.077 | 0.089 | 0.071 | 0.084 | 0.000 | 0.000 | 0.067 | 0.176 |
| HEAT_TYPE | 0.342 | 0.101 | 0.108 | 0.218 | 0.220 | 0.216 | 0.061 | 0.137 | 0.145 | 1.000 | 0.034 | 0.209 | 0.155 | 0.021 | 0.055 | 0.132 | 1.000 | 0.232 | 1.000 | 0.035 | 0.212 | 0.088 | 0.111 | 0.212 | 0.101 | 0.099 | 0.155 | 1.000 | 0.143 | 0.017 | 0.000 | 0.000 | 0.056 | 0.140 | 0.065 | 0.132 | 0.099 | 1.000 | 0.005 | 1.000 | 0.139 | 0.089 | 0.374 | 0.035 | 0.124 | 0.000 | 0.000 | 0.083 | 0.131 |
| HLF_BTH | 0.070 | 0.069 | 0.171 | 0.088 | 0.055 | 0.174 | -0.077 | 0.074 | -0.014 | NaN | 0.042 | 0.038 | 0.093 | 0.253 | 0.193 | 0.081 | 0.141 | 0.063 | 0.035 | 1.000 | 0.050 | 0.081 | 0.136 | 0.089 | 0.115 | 0.155 | 0.155 | 0.143 | 0.180 | -0.371 | 0.031 | 0.220 | 0.056 | 0.069 | 0.268 | 0.081 | 0.046 | NaN | 0.220 | NaN | 0.109 | 0.117 | 0.103 | -0.094 | 0.178 | 0.091 | 0.190 | 0.068 | 0.081 |
| INT_COND | 0.319 | 0.145 | 0.142 | 0.525 | 0.499 | 0.506 | 0.162 | 0.217 | 0.194 | 0.000 | 0.127 | 0.428 | 0.305 | 0.048 | 0.047 | 0.202 | 1.000 | 0.109 | 0.212 | 0.050 | 1.000 | 0.294 | 0.142 | 0.533 | 0.412 | 0.418 | 0.215 | 1.000 | 0.202 | 0.012 | 0.009 | 0.000 | 0.116 | 0.478 | 0.044 | 0.202 | 0.194 | 0.000 | 0.009 | 0.000 | 0.198 | 0.160 | 0.178 | 0.046 | 0.152 | 0.006 | 0.012 | 0.121 | 0.215 |
| INT_WALL | 0.116 | 0.189 | 0.044 | 0.355 | 0.351 | 0.357 | 0.315 | 0.166 | 0.119 | 0.000 | 0.094 | 0.274 | 0.354 | 0.129 | 0.044 | 0.145 | 1.000 | 0.102 | 0.088 | 0.081 | 0.294 | 1.000 | 0.099 | 0.366 | 0.168 | 0.155 | 0.099 | 1.000 | 0.229 | 0.144 | 0.016 | 0.000 | 0.104 | 0.260 | 0.057 | 0.145 | 0.223 | 0.000 | 0.008 | 0.000 | 0.165 | 0.115 | 0.338 | 0.028 | 0.041 | 0.000 | 0.000 | 0.118 | 0.166 |
| KITCHENS | 0.242 | 0.000 | 0.696 | 0.118 | 0.185 | 0.255 | -0.010 | 0.183 | 0.135 | NaN | 0.000 | 0.163 | 0.463 | -0.181 | 0.859 | 0.126 | 0.245 | 0.000 | 0.111 | 0.136 | 0.142 | 0.099 | 1.000 | 0.133 | 0.034 | 0.048 | 0.777 | 0.222 | 0.752 | -0.210 | 0.007 | 0.223 | 0.000 | 0.061 | 0.475 | 0.126 | 0.078 | NaN | 0.417 | NaN | 0.179 | 0.172 | 0.225 | -0.125 | 0.721 | -0.165 | 0.066 | 0.097 | 0.126 |
| KITCHEN_STYLE1 | 0.275 | 0.188 | 0.113 | 0.769 | 0.658 | 0.613 | 0.220 | 0.229 | 0.195 | 0.000 | 0.112 | 0.343 | 0.329 | 0.087 | 0.058 | 0.191 | 1.000 | 0.137 | 0.212 | 0.089 | 0.533 | 0.366 | 0.133 | 1.000 | 0.638 | 0.646 | 0.192 | 1.000 | 0.172 | 1.000 | 0.014 | 0.000 | 0.125 | 0.319 | 0.057 | 0.191 | 0.215 | 0.000 | 0.011 | 0.000 | 0.191 | 0.136 | 0.298 | 0.051 | 0.129 | 0.000 | 0.006 | 0.156 | 0.218 |
| KITCHEN_STYLE2 | 0.250 | 0.479 | 0.062 | 0.542 | 0.678 | 0.589 | 1.000 | 0.126 | 0.561 | 0.000 | 0.410 | 0.222 | 0.145 | 0.123 | 0.150 | 0.133 | 1.000 | 0.234 | 0.101 | 0.115 | 0.412 | 0.168 | 0.034 | 0.638 | 1.000 | 0.812 | 0.087 | 1.000 | 0.068 | 1.000 | 0.006 | 0.015 | 0.283 | 0.358 | 0.101 | 0.133 | 0.061 | 0.000 | 1.000 | 0.000 | 0.088 | 0.066 | 0.128 | 0.007 | 0.029 | 1.000 | 0.065 | 0.066 | 0.138 |
| KITCHEN_STYLE3 | 0.251 | 1.000 | 0.083 | 0.552 | 0.638 | 0.713 | 1.000 | 0.099 | 0.217 | 0.000 | 0.524 | 0.211 | 0.136 | 0.117 | 0.177 | 0.094 | 1.000 | 0.000 | 0.099 | 0.155 | 0.418 | 0.155 | 0.048 | 0.646 | 0.812 | 1.000 | 0.095 | 1.000 | 0.083 | 1.000 | 0.009 | 0.021 | 1.000 | 0.357 | 0.148 | 0.094 | 0.064 | 0.000 | 1.000 | 0.000 | 0.090 | 0.023 | 0.030 | 0.016 | 0.050 | 1.000 | 0.000 | 0.041 | 0.105 |
| KITCHEN_TYPE | 0.266 | 0.094 | 0.355 | 0.177 | 0.261 | 0.309 | 0.079 | 0.252 | 0.140 | 0.000 | 0.074 | 0.227 | 0.236 | 0.044 | 0.245 | 0.208 | 1.000 | 0.041 | 0.155 | 0.155 | 0.215 | 0.099 | 0.777 | 0.192 | 0.087 | 0.095 | 1.000 | 1.000 | 0.950 | 1.000 | 0.003 | 0.003 | 0.140 | 0.076 | 0.304 | 0.208 | 0.141 | 0.000 | 0.000 | 0.000 | 0.272 | 0.289 | 0.172 | 0.049 | 0.430 | 0.000 | 0.008 | 0.166 | 0.228 |
| LIVING_AREA | 1.000 | 1.000 | 0.873 | 1.000 | 1.000 | 1.000 | -0.051 | 0.010 | -0.014 | NaN | 1.000 | 0.000 | 0.130 | 0.103 | 0.343 | 0.116 | 0.970 | 1.000 | 1.000 | 0.143 | 1.000 | 1.000 | 0.222 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.049 | 0.369 | 0.000 | 0.449 | 1.000 | 0.047 | 0.023 | 0.116 | 0.006 | NaN | 0.790 | NaN | 0.000 | 0.000 | 0.096 | -0.065 | 0.900 | -0.104 | 0.248 | 0.004 | 0.116 |
| LU | 0.245 | 0.004 | 0.351 | 0.168 | 0.260 | 0.305 | 1.000 | 0.175 | 0.078 | 1.000 | 1.000 | 0.233 | 0.331 | 0.070 | 0.250 | 0.191 | 0.052 | 0.020 | 0.143 | 0.180 | 0.202 | 0.229 | 0.752 | 0.172 | 0.068 | 0.083 | 0.950 | 0.049 | 1.000 | 0.838 | 0.012 | 0.330 | 1.000 | 0.095 | 0.570 | 0.191 | 0.122 | 1.000 | 0.057 | 1.000 | 0.268 | 0.279 | 0.307 | 0.067 | 0.408 | 0.000 | 0.033 | 0.140 | 0.211 |
| LUC | 0.016 | 0.009 | 0.282 | 1.000 | 1.000 | 1.000 | NaN | 0.072 | -0.048 | NaN | 1.000 | 0.047 | 0.344 | -0.283 | -0.256 | -0.125 | 0.303 | 0.020 | 0.017 | -0.371 | 0.012 | 0.144 | -0.210 | 1.000 | 1.000 | 1.000 | 1.000 | 0.369 | 0.838 | 1.000 | 0.005 | -0.083 | 1.000 | 0.054 | 0.416 | -0.125 | 0.016 | NaN | 0.459 | NaN | 0.038 | 0.044 | 0.191 | 0.073 | 0.294 | -0.045 | -0.038 | -0.134 | -0.125 |
| NUM_BLDGS | 0.005 | 0.000 | 0.003 | 0.015 | 0.021 | 0.034 | 0.004 | 0.006 | 0.000 | 0.000 | 0.000 | 0.011 | 0.000 | 0.002 | 0.017 | 0.002 | 0.000 | 0.013 | 0.000 | 0.031 | 0.009 | 0.016 | 0.007 | 0.014 | 0.006 | 0.009 | 0.003 | 0.000 | 0.012 | 0.005 | 1.000 | 0.000 | 0.000 | 0.012 | 0.004 | 0.002 | 0.005 | 0.000 | 0.000 | 0.000 | 0.003 | 0.000 | 0.000 | 0.000 | 0.003 | 0.000 | 0.000 | 0.003 | 0.004 |
| NUM_PARKING | 0.000 | 1.000 | 0.449 | 0.012 | 0.000 | 0.000 | -0.102 | 0.026 | 0.230 | NaN | 1.000 | 0.037 | 0.584 | 0.125 | 0.233 | 0.438 | 0.519 | 1.000 | 0.000 | 0.220 | 0.000 | 0.000 | 0.223 | 0.000 | 0.015 | 0.021 | 0.003 | 0.449 | 0.330 | -0.083 | 0.000 | 1.000 | 1.000 | 0.016 | 0.016 | 0.438 | 0.000 | NaN | 0.311 | NaN | 0.013 | 0.010 | 0.039 | -0.103 | 0.457 | 0.142 | 0.166 | 0.286 | 0.438 |
| ORIENTATION | 0.077 | 0.101 | 0.125 | 0.125 | 0.146 | 0.208 | 0.121 | 0.169 | 0.135 | 0.000 | 0.295 | 0.127 | 0.213 | 0.049 | 0.030 | 0.135 | 1.000 | 0.090 | 0.056 | 0.056 | 0.116 | 0.104 | 0.000 | 0.125 | 0.283 | 1.000 | 0.140 | 1.000 | 1.000 | 1.000 | 0.000 | 1.000 | 1.000 | 0.073 | 0.168 | 0.135 | 0.162 | 0.000 | 0.031 | 0.000 | 0.160 | 0.149 | 0.370 | 0.088 | 0.172 | 0.000 | 0.000 | 0.157 | 0.162 |
| OVERALL_COND | 0.208 | 0.181 | 0.044 | 0.330 | 0.325 | 0.346 | 0.169 | 0.080 | 0.099 | 0.000 | 0.098 | 0.398 | 0.203 | 0.091 | 0.047 | 0.070 | 0.051 | 0.116 | 0.140 | 0.069 | 0.478 | 0.260 | 0.061 | 0.319 | 0.358 | 0.357 | 0.076 | 0.047 | 0.095 | 0.054 | 0.012 | 0.016 | 0.073 | 1.000 | 0.058 | 0.070 | 0.139 | 0.000 | 0.031 | 0.059 | 0.170 | 0.124 | 0.286 | 0.021 | 0.055 | 0.000 | 0.034 | 0.091 | 0.076 |
| OWN_OCC | 0.030 | 0.074 | 0.239 | 0.057 | 0.084 | 0.080 | 0.097 | 0.261 | 0.153 | 1.000 | 0.036 | 0.070 | 0.334 | 0.038 | 0.045 | 0.197 | 0.025 | 0.122 | 0.065 | 0.268 | 0.044 | 0.057 | 0.475 | 0.057 | 0.101 | 0.148 | 0.304 | 0.023 | 0.570 | 0.416 | 0.004 | 0.016 | 0.168 | 0.058 | 1.000 | 0.197 | 0.091 | 1.000 | 0.045 | 1.000 | 0.260 | 0.265 | 0.199 | 0.086 | 0.292 | 0.000 | 0.013 | 0.127 | 0.255 |
| PID | 0.203 | 0.127 | 0.261 | 0.195 | 0.218 | 0.209 | -0.260 | 0.681 | 1.000 | -0.125 | 0.143 | 0.184 | 0.229 | 0.018 | 0.062 | 1.000 | 0.194 | 0.162 | 0.132 | 0.081 | 0.202 | 0.145 | 0.126 | 0.191 | 0.133 | 0.094 | 0.208 | 0.116 | 0.191 | -0.125 | 0.002 | 0.438 | 0.135 | 0.070 | 0.197 | 1.000 | 0.161 | -0.004 | 0.034 | -0.169 | 0.262 | 0.258 | 0.247 | -0.058 | 0.255 | 0.150 | -0.023 | 0.608 | 1.000 |
| PROP_VIEW | 0.171 | 0.188 | 0.100 | 0.228 | 0.252 | 0.294 | 0.403 | 0.177 | 0.152 | 1.000 | 0.169 | 0.205 | 0.211 | 0.029 | 0.040 | 0.161 | 0.006 | 0.083 | 0.099 | 0.046 | 0.194 | 0.223 | 0.078 | 0.215 | 0.061 | 0.064 | 0.141 | 0.006 | 0.122 | 0.016 | 0.005 | 0.000 | 0.162 | 0.139 | 0.091 | 0.161 | 1.000 | 1.000 | 0.010 | 0.077 | 0.157 | 0.131 | 0.324 | 0.029 | 0.105 | 0.000 | 0.000 | 0.155 | 0.185 |
| RC_UNITS | 1.000 | 0.000 | NaN | 0.000 | 0.000 | 0.000 | NaN | 0.000 | -0.004 | 0.093 | 0.000 | 0.000 | 0.000 | NaN | NaN | -0.004 | 0.143 | 1.000 | 1.000 | NaN | 0.000 | 0.000 | NaN | 0.000 | 0.000 | 0.000 | 0.000 | NaN | 1.000 | NaN | 0.000 | NaN | 0.000 | 0.000 | 1.000 | -0.004 | 1.000 | 1.000 | 0.075 | -0.035 | 0.000 | 0.000 | 1.000 | 0.055 | NaN | 0.043 | NaN | -0.008 | -0.004 |
| RES_FLOOR | 0.000 | 0.022 | 0.742 | 0.012 | 0.012 | 0.014 | -0.382 | 0.021 | 0.028 | 0.248 | 0.000 | 0.030 | 0.168 | 0.057 | 0.433 | 0.034 | 0.775 | 0.000 | 0.005 | 0.220 | 0.009 | 0.008 | 0.417 | 0.011 | 1.000 | 1.000 | 0.000 | 0.790 | 0.057 | 0.459 | 0.000 | 0.311 | 0.031 | 0.031 | 0.045 | 0.034 | 0.010 | 0.075 | 1.000 | 0.473 | 0.021 | 0.017 | 0.122 | -0.088 | 0.767 | -0.161 | 0.197 | -0.024 | 0.034 |
| RES_UNITS | 1.000 | 0.000 | NaN | 0.000 | 0.000 | 0.000 | NaN | 0.084 | -0.169 | 0.073 | 0.000 | 0.077 | 0.131 | NaN | NaN | -0.169 | 0.252 | 1.000 | 1.000 | NaN | 0.000 | 0.000 | NaN | 0.000 | 0.000 | 0.000 | 0.000 | NaN | 1.000 | NaN | 0.000 | NaN | 0.000 | 0.059 | 1.000 | -0.169 | 0.077 | -0.035 | 0.473 | 1.000 | 0.059 | 0.038 | 0.205 | 0.120 | NaN | 0.308 | NaN | -0.240 | -0.169 |
| ROOF_COVER | 0.220 | 0.089 | 0.211 | 0.198 | 0.215 | 0.195 | 0.181 | 0.265 | 0.237 | 0.000 | 0.144 | 0.219 | 0.303 | 0.082 | 0.019 | 0.262 | 0.002 | 0.166 | 0.139 | 0.109 | 0.198 | 0.165 | 0.179 | 0.191 | 0.088 | 0.090 | 0.272 | 0.000 | 0.268 | 0.038 | 0.003 | 0.013 | 0.160 | 0.170 | 0.260 | 0.262 | 0.157 | 0.000 | 0.021 | 0.059 | 1.000 | 0.506 | 0.245 | 0.064 | 0.233 | 0.000 | 0.000 | 0.154 | 0.280 |
| ROOF_STRUCTURE | 0.174 | 0.055 | 0.221 | 0.138 | 0.187 | 0.138 | 0.124 | 0.265 | 0.219 | 0.000 | 0.146 | 0.170 | 0.259 | 0.061 | 0.011 | 0.258 | 0.000 | 0.077 | 0.089 | 0.117 | 0.160 | 0.115 | 0.172 | 0.136 | 0.066 | 0.023 | 0.289 | 0.000 | 0.279 | 0.044 | 0.000 | 0.010 | 0.149 | 0.124 | 0.265 | 0.258 | 0.131 | 0.000 | 0.017 | 0.038 | 0.506 | 1.000 | 0.203 | 0.060 | 0.247 | 0.000 | 0.003 | 0.146 | 0.269 |
| STRUCTURE_CLASS | 0.146 | 0.364 | 0.138 | 0.307 | 0.313 | 0.413 | 0.389 | 0.259 | 0.341 | 1.000 | 0.046 | 0.284 | 0.596 | 0.099 | 0.065 | 0.247 | 0.102 | 0.089 | 0.374 | 0.103 | 0.178 | 0.338 | 0.225 | 0.298 | 0.128 | 0.030 | 0.172 | 0.096 | 0.307 | 0.191 | 0.000 | 0.039 | 0.370 | 0.286 | 0.199 | 0.247 | 0.324 | 1.000 | 0.122 | 0.205 | 0.245 | 0.203 | 1.000 | 0.084 | 0.188 | 1.000 | 0.062 | 0.229 | 0.258 |
| ST_NUM | 0.032 | 0.035 | -0.146 | 0.060 | 0.039 | 0.040 | 0.021 | 0.176 | 0.066 | 0.145 | 0.062 | 0.044 | 0.075 | 0.000 | -0.112 | -0.058 | -0.089 | 0.071 | 0.035 | -0.094 | 0.046 | 0.028 | -0.125 | 0.051 | 0.007 | 0.016 | 0.049 | -0.065 | 0.067 | 0.073 | 0.000 | -0.103 | 0.088 | 0.021 | 0.086 | -0.058 | 0.029 | 0.055 | -0.088 | 0.120 | 0.064 | 0.060 | 0.084 | 1.000 | -0.149 | 0.028 | -0.028 | 0.016 | -0.058 |
| TT_RMS | 0.195 | 0.127 | 0.940 | 0.122 | 0.207 | 0.271 | -0.216 | 0.220 | 0.135 | NaN | 0.085 | 0.166 | 0.218 | 0.054 | 0.664 | 0.255 | 0.925 | 0.084 | 0.124 | 0.178 | 0.152 | 0.041 | 0.721 | 0.129 | 0.029 | 0.050 | 0.430 | 0.900 | 0.408 | 0.294 | 0.003 | 0.457 | 0.172 | 0.055 | 0.292 | 0.255 | 0.105 | NaN | 0.767 | NaN | 0.233 | 0.247 | 0.188 | -0.149 | 1.000 | -0.176 | 0.282 | 0.125 | 0.255 |
| YR_BUILT | 0.000 | 0.000 | -0.147 | 0.000 | 0.000 | 1.000 | 0.184 | 0.000 | 0.101 | 0.105 | 0.000 | 0.009 | 0.000 | -0.062 | -0.060 | 0.150 | -0.104 | 0.000 | 0.000 | 0.091 | 0.006 | 0.000 | -0.165 | 0.000 | 1.000 | 1.000 | 0.000 | -0.104 | 0.000 | -0.045 | 0.000 | 0.142 | 0.000 | 0.000 | 0.000 | 0.150 | 0.000 | 0.043 | -0.161 | 0.308 | 0.000 | 0.000 | 1.000 | 0.028 | -0.176 | 1.000 | 0.055 | 0.160 | 0.150 |
| YR_REMODEL | 0.006 | 0.000 | 0.277 | 0.006 | 0.000 | 0.008 | -0.070 | 0.021 | -0.023 | NaN | 0.000 | 0.000 | 0.036 | 0.104 | 0.233 | -0.023 | 0.243 | 0.000 | 0.000 | 0.190 | 0.012 | 0.000 | 0.066 | 0.006 | 0.065 | 0.000 | 0.008 | 0.248 | 0.033 | -0.038 | 0.000 | 0.166 | 0.000 | 0.034 | 0.013 | -0.023 | 0.000 | NaN | 0.197 | NaN | 0.000 | 0.003 | 0.062 | -0.028 | 0.282 | 0.055 | 1.000 | 0.007 | -0.023 |
| ZIP_CODE | 0.119 | 0.127 | 0.123 | 0.159 | 0.209 | 0.239 | -0.232 | 0.694 | 0.582 | -0.137 | 0.101 | 0.099 | 0.218 | -0.011 | 0.052 | 0.608 | 0.070 | 0.067 | 0.083 | 0.068 | 0.121 | 0.118 | 0.097 | 0.156 | 0.066 | 0.041 | 0.166 | 0.004 | 0.140 | -0.134 | 0.003 | 0.286 | 0.157 | 0.091 | 0.127 | 0.608 | 0.155 | -0.008 | -0.024 | -0.240 | 0.154 | 0.146 | 0.229 | 0.016 | 0.125 | 0.160 | 0.007 | 1.000 | 0.608 |
| _id | 0.213 | 0.155 | 0.261 | 0.218 | 0.258 | 0.273 | -0.260 | 0.752 | 1.000 | -0.125 | 0.169 | 0.193 | 0.244 | 0.018 | 0.062 | 1.000 | 0.194 | 0.176 | 0.131 | 0.081 | 0.215 | 0.166 | 0.126 | 0.218 | 0.138 | 0.105 | 0.228 | 0.116 | 0.211 | -0.125 | 0.004 | 0.438 | 0.162 | 0.076 | 0.255 | 1.000 | 0.185 | -0.004 | 0.034 | -0.169 | 0.280 | 0.269 | 0.258 | -0.058 | 0.255 | 0.150 | -0.023 | 0.608 | 1.000 |
| _id | PID | CM_ID | GIS_ID | ST_NUM | ST_NAME | UNIT_NUM | CITY | ZIP_CODE | BLDG_SEQ | NUM_BLDGS | LUC | LU | LU_DESC | BLDG_TYPE | OWN_OCC | OWNER | MAIL_ADDRESSEE | MAIL_STREET_ADDRESS | MAIL_CITY | MAIL_STATE | MAIL_ZIP_CODE | RES_FLOOR | CD_FLOOR | RES_UNITS | COM_UNITS | RC_UNITS | LAND_SF | GROSS_AREA | LIVING_AREA | LAND_VALUE | BLDG_VALUE | SFYI_VALUE | TOTAL_VALUE | GROSS_TAX | YR_BUILT | YR_REMODEL | STRUCTURE_CLASS | ROOF_STRUCTURE | ROOF_COVER | INT_WALL | EXT_FNISHED | INT_COND | EXT_COND | OVERALL_COND | BED_RMS | FULL_BTH | HLF_BTH | KITCHENS | TT_RMS | BDRM_COND | BTHRM_STYLE1 | BTHRM_STYLE2 | BTHRM_STYLE3 | KITCHEN_TYPE | KITCHEN_STYLE1 | KITCHEN_STYLE2 | KITCHEN_STYLE3 | HEAT_TYPE | HEAT_SYSTEM | AC_TYPE | FIREPLACES | ORIENTATION | NUM_PARKING | PROP_VIEW | CORNER_UNIT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 100001000 | NaN | 100001000 | 104.0 | PUTNAM ST | NaN | EAST BOSTON | 2128.0 | 1 | 1 | 105 | R3 | THREE-FAM DWELLING | RE - Row End | Y | PASCUCCI CARLO | NaN | 195 LEXINGTON ST | EAST BOSTON | MA | 2128.0 | 3.0 | NaN | NaN | NaN | NaN | 1,150 | 3353.0 | 2202.0 | 197,600 | 594,400 | 0 | 792,000 | $8,632.80 | 1900.0 | NaN | NaN | F - Flat | C - Composition | N - Normal | A - Asbestos | A - Average | F - Fair | A - Average | 6.0 | 3.0 | 0.0 | 3.0 | 12.0 | NaN | S - Semi-Modern | S - Semi-Modern | S - Semi-Modern | 3F - 3 Full Eat In Kitchens | S - Semi-Modern | S - Semi-Modern | S - Semi-Modern | W - Ht Water/Steam | NaN | N - None | 0.0 | NaN | 3.0 | A - Average | NaN |
| 1 | 2 | 100002000 | NaN | 100002000 | 197.0 | Lexington ST | NaN | EAST BOSTON | 2128.0 | 1 | 1 | 105 | R3 | THREE-FAM DWELLING | RM - Row Middle | N | SEMBRANO RODERICK | NaN | 197 LEXINGTON ST | EAST BOSTON | MA | 2128.0 | 3.0 | NaN | NaN | NaN | NaN | 1,150 | 3047.0 | 2307.0 | 198,500 | 619,700 | 0 | 818,200 | $8,918.38 | 1920.0 | 2000.0 | NaN | F - Flat | C - Composition | N - Normal | M - Vinyl | A - Average | A - Average | A - Average | 3.0 | 3.0 | 0.0 | 3.0 | 9.0 | NaN | M - Modern | M - Modern | M - Modern | 3F - 3 Full Eat In Kitchens | M - Modern | M - Modern | M - Modern | F - Forced Hot Air | NaN | C - Central AC | 0.0 | NaN | 0.0 | A - Average | NaN |
| 2 | 3 | 100003000 | NaN | 100003000 | 199.0 | Lexington ST | NaN | EAST BOSTON | 2128.0 | 1 | 1 | 105 | R3 | THREE-FAM DWELLING | RM - Row Middle | Y | GUERRA CHEVARRIA ANA S | NaN | 199 LEXINGTON ST | EAST BOSTON | MA | 2128.0 | 3.0 | NaN | NaN | NaN | NaN | 1,150 | 3392.0 | 2268.0 | 199,100 | 605,300 | 0 | 804,400 | $8,767.96 | 1905.0 | 1985.0 | NaN | F - Flat | C - Composition | N - Normal | M - Vinyl | A - Average | G - Good | A - Average | 5.0 | 3.0 | 0.0 | 3.0 | 13.0 | NaN | M - Modern | M - Modern | M - Modern | 3F - 3 Full Eat In Kitchens | S - Semi-Modern | S - Semi-Modern | S - Semi-Modern | S - Space Heat | NaN | N - None | 0.0 | NaN | 0.0 | A - Average | NaN |
| 3 | 4 | 100004000 | NaN | 100004000 | 201.0 | Lexington ST | NaN | EAST BOSTON | 2128.0 | 1 | 1 | 105 | R3 | THREE-FAM DWELLING | RM - Row Middle | N | JB REALTY TRUST | NaN | PO BOX 557 # | EVERETT | MA | 2149.0 | 3.0 | NaN | NaN | NaN | NaN | 1,150 | 3108.0 | 2028.0 | 199,700 | 535,600 | 0 | 735,300 | $8,014.77 | 1900.0 | 1991.0 | NaN | M - Mansard | C - Composition | N - Normal | M - Vinyl | A - Average | A - Average | A - Average | 5.0 | 3.0 | 0.0 | 3.0 | 11.0 | NaN | S - Semi-Modern | S - Semi-Modern | S - Semi-Modern | 3F - 3 Full Eat In Kitchens | S - Semi-Modern | S - Semi-Modern | S - Semi-Modern | W - Ht Water/Steam | NaN | N - None | 0.0 | NaN | 0.0 | A - Average | NaN |
| 4 | 5 | 100005000 | NaN | 100005000 | 203.0 | Lexington ST | NaN | EAST BOSTON | 2128.0 | 1 | 1 | 104 | R2 | TWO-FAM DWELLING | RE - Row End | Y | MARKS TRAVIS JOSEPH | NaN | 203 Lexington ST | EAST BOSTON | MA | 2128.0 | 3.0 | NaN | NaN | NaN | NaN | 2,010 | 3700.0 | 2546.0 | 230,200 | 501,400 | 0 | 731,600 | $7,974.44 | 1900.0 | 1978.0 | NaN | M - Mansard | C - Composition | N - Normal | M - Vinyl | A - Average | F - Fair | A - Average | 6.0 | 3.0 | 0.0 | 2.0 | 13.0 | NaN | N - No Remodeling | N - No Remodeling | N - No Remodeling | 2F - 2 Full Eat In Kitchens | N - No Remodeling | N - No Remodeling | NaN | W - Ht Water/Steam | NaN | N - None | 0.0 | NaN | 0.0 | A - Average | NaN |
| 5 | 6 | 100006000 | NaN | 100006000 | 205.0 | Lexington ST | NaN | EAST BOSTON | 2128.0 | 1 | 1 | 105 | R3 | THREE-FAM DWELLING | DK - Decker | N | 205 LEXINGTON LLC | NaN | 28 LAUDHOLM RD | NEWTON | MA | 2458.0 | 3.0 | NaN | NaN | NaN | NaN | 2,500 | 6278.0 | 4362.0 | 263,800 | 1,037,400 | 0 | 1,301,200 | $14,183.08 | 1900.0 | 2018.0 | NaN | F - Flat | R - Rubber Roof | N - Normal | A - Asbestos | G - Good | A - Average | A - Average | 13.0 | 6.0 | 0.0 | 3.0 | 20.0 | NaN | M - Modern | M - Modern | M - Modern | 3F - 3 Full Eat In Kitchens | M - Modern | M - Modern | M - Modern | E - Electric | NaN | N - None | 0.0 | NaN | 0.0 | A - Average | NaN |
| 6 | 7 | 100007000 | NaN | 100007000 | 209.0 | Lexington ST | NaN | EAST BOSTON | 2128.0 | 1 | 1 | 105 | R3 | THREE-FAM DWELLING | DK - Decker | N | YOON SUNG PIL | NaN | -211 209 LEXINGTON ST | EAST BOSTON | MA | 2128.0 | 3.0 | NaN | NaN | NaN | NaN | 2,500 | 6432.0 | 4296.0 | 264,700 | 1,003,200 | 0 | 1,267,900 | $13,820.11 | 1900.0 | 2009.0 | NaN | F - Flat | R - Rubber Roof | N - Normal | A - Asbestos | A - Average | A - Average | A - Average | 14.0 | 5.0 | 0.0 | 3.0 | 20.0 | NaN | M - Modern | M - Modern | M - Modern | 3F - 3 Full Eat In Kitchens | S - Semi-Modern | S - Semi-Modern | S - Semi-Modern | W - Ht Water/Steam | NaN | N - None | 0.0 | NaN | 0.0 | A - Average | NaN |
| 7 | 8 | 100008000 | NaN | 100008000 | 213.0 | Lexington ST | NaN | EAST BOSTON | 2128.0 | 1 | 1 | 105 | R3 | THREE-FAM DWELLING | DK - Decker | Y | CASTALDINI ANTONIO | NaN | 213 LEXINGTON ST | E BOSTON | MA | 2128.0 | 3.0 | NaN | NaN | NaN | NaN | 2,500 | 6048.0 | 4080.0 | 265,300 | 885,400 | 0 | 1,150,700 | $12,542.63 | 1900.0 | NaN | NaN | F - Flat | C - Composition | N - Normal | M - Vinyl | A - Average | A - Average | A - Average | 11.0 | 3.0 | 0.0 | 3.0 | 16.0 | NaN | S - Semi-Modern | S - Semi-Modern | S - Semi-Modern | 3F - 3 Full Eat In Kitchens | S - Semi-Modern | S - Semi-Modern | S - Semi-Modern | W - Ht Water/Steam | NaN | N - None | 0.0 | NaN | 0.0 | A - Average | NaN |
| 8 | 9 | 100009000 | NaN | 100009000 | 215.0 | Lexington ST | NaN | EAST BOSTON | 2128.0 | 1 | 1 | 105 | R3 | THREE-FAM DWELLING | DK - Decker | N | BEAGLEMIKE FAMILY TRUST | NaN | 215 LEXINGTON ST | EAST BOSTON | MA | 2128.0 | 3.0 | NaN | NaN | NaN | NaN | 2,500 | 4339.0 | 2937.0 | 265,900 | 619,300 | 0 | 885,200 | $9,648.68 | 1900.0 | 1998.0 | NaN | F - Flat | R - Rubber Roof | N - Normal | A - Asbestos | A - Average | A - Average | A - Average | 5.0 | 3.0 | 0.0 | 3.0 | 14.0 | NaN | S - Semi-Modern | S - Semi-Modern | M - Modern | 3F - 3 Full Eat In Kitchens | S - Semi-Modern | S - Semi-Modern | M - Modern | W - Ht Water/Steam | NaN | N - None | 0.0 | NaN | 0.0 | A - Average | NaN |
| 9 | 10 | 100010000 | NaN | 100010000 | 217.0 | Lexington ST | NaN | EAST BOSTON | 2128.0 | 1 | 1 | 105 | R3 | THREE-FAM DWELLING | DK - Decker | N | APPLETON GROVE LLC | C/O LAUREN SCHOENADEL | 217 LEXINGTON ST UNIT 1 | BOSTON | MA | 2128.0 | 3.0 | NaN | NaN | NaN | NaN | 2,500 | 4659.0 | 3241.0 | 226,700 | 811,000 | 0 | 1,037,700 | $11,310.93 | 1900.0 | 2020.0 | NaN | F - Flat | R - Rubber Roof | N - Normal | C - Cement Board | G - Good | G - Good | G - Good | 6.0 | 3.0 | 0.0 | 3.0 | 14.0 | NaN | M - Modern | M - Modern | M - Modern | 3F - 3 Full Eat In Kitchens | M - Modern | M - Modern | M - Modern | W - Ht Water/Steam | NaN | C - Central AC | 0.0 | NaN | 0.0 | A - Average | NaN |
| _id | PID | CM_ID | GIS_ID | ST_NUM | ST_NAME | UNIT_NUM | CITY | ZIP_CODE | BLDG_SEQ | NUM_BLDGS | LUC | LU | LU_DESC | BLDG_TYPE | OWN_OCC | OWNER | MAIL_ADDRESSEE | MAIL_STREET_ADDRESS | MAIL_CITY | MAIL_STATE | MAIL_ZIP_CODE | RES_FLOOR | CD_FLOOR | RES_UNITS | COM_UNITS | RC_UNITS | LAND_SF | GROSS_AREA | LIVING_AREA | LAND_VALUE | BLDG_VALUE | SFYI_VALUE | TOTAL_VALUE | GROSS_TAX | YR_BUILT | YR_REMODEL | STRUCTURE_CLASS | ROOF_STRUCTURE | ROOF_COVER | INT_WALL | EXT_FNISHED | INT_COND | EXT_COND | OVERALL_COND | BED_RMS | FULL_BTH | HLF_BTH | KITCHENS | TT_RMS | BDRM_COND | BTHRM_STYLE1 | BTHRM_STYLE2 | BTHRM_STYLE3 | KITCHEN_TYPE | KITCHEN_STYLE1 | KITCHEN_STYLE2 | KITCHEN_STYLE3 | HEAT_TYPE | HEAT_SYSTEM | AC_TYPE | FIREPLACES | ORIENTATION | NUM_PARKING | PROP_VIEW | CORNER_UNIT | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 182232 | 182233 | 2205663001 | NaN | 2205663001 | 20.0 | Lake ST | NaN | BRIGHTON | 2135.0 | 1 | 1 | 101 | R1 | SINGLE FAM DWELLING | SD - Semi-Det | Y | MCGOVERN DENNIS | C/O DENNIS L MCGOVERN | 20 LAKE ST | BRIGHTON | MA | 2135 | 2.0 | NaN | NaN | NaN | NaN | 3,778 | 4240.0 | 2390.4 | 289,500 | 633,400 | 0 | 922,900 | $10,059.61 | 1920.0 | NaN | NaN | H - Hip | S - Slate | N - Normal | S - Stucco | A - Average | A - Average | A - Average | 7.0 | 2.0 | 1.0 | 1.0 | 10.0 | NaN | S - Semi-Modern | S - Semi-Modern | S - Semi-Modern | 1F - 1 Full Eat In Kitchens | S - Semi-Modern | NaN | NaN | W - Ht Water/Steam | NaN | N - None | 2.0 | NaN | 3.0 | A - Average | NaN |
| 182233 | 182234 | 2205664000 | NaN | 2205664000 | 18.0 | Lake ST | NaN | BRIGHTON | 2135.0 | 1 | 1 | 104 | R2 | TWO-FAM DWELLING | CV - Conventional | Y | HOFFMAN ANN MARIE | NaN | 18 LAKE ST | BRIGHTON | MA | 2135 | 2.5 | NaN | NaN | NaN | NaN | 5,333 | 4609.0 | 2951.6 | 365,300 | 750,000 | 0 | 1,115,300 | $12,156.77 | 1920.0 | NaN | NaN | L - Gambrel | A - Asphalt Shingl | N - Normal | M - Vinyl | A - Average | A - Average | A - Average | 5.0 | 2.0 | 0.0 | 2.0 | 11.0 | NaN | M - Modern | S - Semi-Modern | NaN | 2F - 2 Full Eat In Kitchens | S - Semi-Modern | S - Semi-Modern | NaN | F - Forced Hot Air | NaN | N - None | 0.0 | NaN | 3.0 | A - Average | NaN |
| 182234 | 182235 | 2205665000 | 2.205665e+09 | 2205665000 | 14.0 | Lake ST | NaN | BRIGHTON | 2135.0 | 1 | 1 | 995 | CM | CONDO MAIN | FS - Free Standing | N | TWELVE 14 LAKE ST CONDO TR | C/O PI-YAO AILEEN LIU | 110 ALGONQUIN ROAD | CHESTNUT HILL | MA | 2467 | 2.0 | NaN | 2.0 | 0.0 | 0.0 | 4,485 | NaN | NaN | 0 | 0 | 0 | 0 | $- | 1999.0 | NaN | NaN | H - Hip | A - Asphalt Shingl | NaN | M - Vinyl | NaN | A - Average | A - Average | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 182235 | 182236 | 2205665002 | 2.205665e+09 | 2205665000 | 14.0 | Lake ST | 2 | BRIGHTON | 2135.0 | 1 | 1 | 102 | CD | RESIDENTIAL CONDO | FS - Free Standing | N | LAI LIU TRUST | NaN | 110 ALGONQUIN RD | CHESTNUT HILL | MA | 2467 | 2.0 | 2.0 | NaN | NaN | NaN | 2,777 | 2777.0 | 1410.0 | 0 | 545,100 | 0 | 545,100 | $5,941.59 | 1920.0 | NaN | NaN | H - Hip | A - Asphalt Shingl | N - Normal | M - Vinyl | A - Average | A - Average | A - Average | 3.0 | 1.0 | 0.0 | 1.0 | 8.0 | A - Average | M - Modern | NaN | NaN | F - Full Eat In | S - Semi-Modern | NaN | NaN | W - Ht Water/Steam | I - Indiv. Cntrl | N - None | 1.0 | T - Through | 1.0 | A - Average | N - No |
| 182236 | 182237 | 2205665004 | 2.205665e+09 | 2205665000 | 12.0 | Lake ST | 1 | BRIGHTON | 2135.0 | 1 | 1 | 102 | CD | RESIDENTIAL CONDO | FS - Free Standing | N | 137-141 CHISWICK REALTY TRUST | C/O SHIH CHUN | 110 ALGONQUIN RD | NEWTON | MA | 2467 | 1.0 | 1.0 | NaN | NaN | NaN | 1,401 | 1401.0 | 1401.0 | 0 | 494,800 | 0 | 494,800 | $5,393.32 | 1920.0 | NaN | NaN | H - Hip | A - Asphalt Shingl | N - Normal | M - Vinyl | A - Average | A - Average | A - Average | 2.0 | 1.0 | 0.0 | 1.0 | 7.0 | A - Average | S - Semi-Modern | NaN | NaN | F - Full Eat In | S - Semi-Modern | NaN | NaN | W - Ht Water/Steam | I - Indiv. Cntrl | N - None | 1.0 | T - Through | 1.0 | A - Average | N - No |
| 182237 | 182238 | 2205666000 | NaN | 2205666000 | NaN | KNOWLES ST | NaN | BRIGHTON | 2135.0 | 1 | 1 | 902 | E | CITY OF BOSTON | 99 - Vacant | N | CITY OF BOSTON BY FCL | NaN | KNOWLES | BRIGHTON | MA | 2135 | NaN | NaN | NaN | NaN | NaN | 5,931 | NaN | NaN | 240,500 | 0 | 0 | 240,500 | $- | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | A - Average | NaN | 0.0 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 182238 | 182239 | 2205667000 | NaN | 2205667000 | NaN | Lake ST | NaN | BRIGHTON | 2135.0 | 1 | 1 | 132 | RL - RL | RES LAND (Unusable) | 99 - Vacant | N | GREALISH MARTIN J TS | NaN | 111 HUNTINGTON AV 12TH FLR | BOSTON | MA | 2199 | NaN | NaN | NaN | NaN | NaN | 4,588 | NaN | NaN | 72,800 | 0 | 0 | 72,800 | $793.52 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | A - Average | NaN | 0.0 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 182239 | 182240 | 2205668000 | NaN | 2205668000 | 4.0 | Lake ST | NaN | BRIGHTON | 2135.0 | 1 | 1 | 105 | R3 | THREE-FAM DWELLING | CV - Conventional | N | EAGLE PROPERTY HOLDINGS LLC | NaN | AIM PROPERTY MANAGEMENT | CAPE NEDDICK | ME | 3902 | 2.5 | NaN | NaN | NaN | NaN | 7,380 | 4291.0 | 2834.4 | 464,400 | 850,500 | 0 | 1,314,900 | $14,332.41 | 1920.0 | 1990.0 | NaN | L - Gambrel | A - Asphalt Shingl | N - Normal | M - Vinyl | G - Good | A - Average | G - Good | 6.0 | 3.0 | 0.0 | 3.0 | 12.0 | NaN | M - Modern | M - Modern | M - Modern | 2F - 2 Full Eat In Kitchens | M - Modern | M - Modern | M - Modern | W - Ht Water/Steam | NaN | N - None | 0.0 | NaN | 2.0 | A - Average | NaN |
| 182240 | 182241 | 2205669000 | NaN | 2205669000 | 2193.0 | COMMONWEALTH AV | NaN | BRIGHTON | 2135.0 | 1 | 1 | 319 | C | STRIP CTR STORES | 319 - STRIP RETAIL/ OFFICE | N | GREALISH MARTIN J TRST | NaN | AIM PROPERTY MANAGEMENT | CAPE NEDDICK | ME | 3902 | NaN | NaN | NaN | NaN | NaN | 12,500 | 14520.0 | 7260.0 | 990,900 | 1,458,800 | 0 | 2,459,200 | $62,143.98 | 1947.0 | 2016.0 | C - Brick/Concr | NaN | NaN | NaN | 01 - Brick | NaN | NaN | G - Good | NaN | 0.0 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 182241 | 182242 | 2205670000 | NaN | 2205670000 | 2203.0 | COMMONWEALTH AV | NaN | BRIGHTON | 2135.0 | 1 | 1 | 985 | E | OTHER EXEMPT BLDG | 973 - ADMINISTRATIVE BLDG | N | COMMWLTH OF MASS | NaN | 2203 COMMONWEALTH AVE | BRIGHTON | MA | 2135 | NaN | NaN | NaN | NaN | NaN | 34,125 | 7386.0 | 7386.0 | 2,138,600 | 1,342,900 | 0 | 3,489,000 | $- | 1900.0 | NaN | C - Brick/Concr | NaN | NaN | NaN | 03 - Poured Concr | NaN | NaN | A - Average | NaN | 0.0 | 0.0 | 0.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |